![]() The issue was that we have 'internal network IP addresses (infiniband)' and externally accessible IP addresses. We had a firewall but even when disabled the connection to HDFS failed.The ResourceManager and JobHistory UI's both worked fine.No external connection to the could be made using curl/openssl/telnet to HDFS.OpenSSL form internal returned 0 for the HDFS connection.I could run the UI for HDFS from internal to the cluster using xwindows enabled putty an XMing.I could connect to HDFS rest interface from the command line internal to the cluster using curl.I had an issue similar when I enabled SSL on HDFS. ![]() 23:18:10,761 - INFO - Unable to read additional data from server sessionid 0x0, likely server has closed socket, closing socket connection and attempting Wong Will not attempt to authenticate using SASL (unknown error) 23:18:06,070 - INFO - Client environment:java.vendor=Oracle Corporation When I try to launch the Zookeeper Cli (./bin/zkCli.sh), I'm getting the following- Connecting to localhost:2181 20:26:11,472 INFO zookeeper.ClientCnxn (ClientCnxn.java:run(1142)) - Unable to read additional data from server sessionid 0x0, likely server has closed socket, closing socket connection and attempting reconnectĢ. 20:26:11,400 INFO recovery.ZKRMStateStore (ZKRMStateStore.java:runWithRetries(1230)) - Retrying operation on ZK. ![]() $ConnectionLossException: KeeperErrorCode = ConnectionLoss for /rmstoreĪt .create(KeeperException.java:99)Īt .create(KeeperException.java:51)Īt .create(ZooKeeper.java:783)Īt .$1.run(ZKRMStateStore.java:326)Īt .$1.run(ZKRMStateStore.java:322)Īt .$nWithCheck(ZKRMStateStore.java:1174)Īt .$nWithRetries(ZKRMStateStore.java:1207)Īt .createRootDir(ZKRMStateStore.java:336)Īt .createRootDirRecursively(ZKRMStateStore.java:1311)Īt .startInternal(ZKRMStateStore.java:303)Īt .serviceStart(RMStateStore.java:598)Īt .AbstractService.start(AbstractService.java:193)Īt .$rviceStart(ResourceManager.java:593)Īt .(ResourceManager.java:1008)Īt .$1.run(ResourceManager.java:1049)Īt .$1.run(ResourceManager.java:1045)Īt (Native Method)Īt .doAs(Subject.java:422)Īt .UserGroupInformation.doAs(UserGroupInformation.java:1869)Īt .(ResourceManager.java:1045)Īt .(ResourceManager.java:1085)Īt .(ResourceManager.java:1229) Hmmm, when I try to start RM, I'm getting this- 20:26:11,400 INFO recovery.ZKRMStateStore (ZKRMStateStore.java:runWithRetries(1227)) - Exception while executing a ZK operation. There are 3 datanode(s) running and 3 node(s) are excluded in this operation.\n\tat "message": "File /apps/zeppelin/zeppelin-spark-dependencies_2.11-0.7.3.2.6.4.0-91.jar could only be replicated to 0 nodes instead of minReplication (=1). Resource_resource.WebHDFSCallException: Execution of 'curl -sS -L -w '%' -X PUT -data-binary -H 'Content-Type: application/octet-stream' ''' returned status_code=403. HiveServer2- raise WebHDFSCallException(err_msg, result_dict) There are 2 datanode(s) running and 2 node(s) are excluded in this operation. RemoteException(java.io.IOException): File /user/oozie/share/lib/lib_20180518045451/oozie/jackson-databind-2.4.4.jar could only be replicated to 0 nodes instead of minReplication (=1). Oozie- Stack trace for the error was (for debug purposes): Here are some of the errors for the respective components:
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |