How to connect to hopsworks instance created on single machine by an image

I have a single machine instance created from an ova image. There is no api key generator module. I wish to connect to this instance from python environment from my local machine. Can you help me with the fs.connect()

Hi amritsh,

What version of Hopsworks did you install?

the image was- hopsworks-0.10.0.ova

0.10 is an old release and does not support api keys. Please follow our instructions to install the latest version: https://hopsworks.readthedocs.io/en/latest/getting_started/installation_guide/installation.html

Is there an image available for newer version? The normal single machine setup is getting stuck

We stopped supporting ova. In what way is the setup getting stuck?

is it possible to change the port for hopsworks UI?
usually it’s deployed at 9090, but that is already being used
we are following this -> https://hopsworks.readthedocs.io/en/latest/getting_started/installation_guide/platforms/baremetal.html#quickstart-single-host-installation

There’s a karamel UI at 9090 port and I am getting these logs

(base) [subexgpu5@master subex_dss]$ tail -f installation.log
ERROR [2020-09-10 16:22:37,816] se.kth.karamel.webservice.KaramelServiceApplication: An instance of Karamel is already running. Exiting…
ERROR [2020-09-10 16:22:37,820] se.kth.karamel.webservice.KaramelServiceApplication: java.net.BindException: Address already in use (Bind failed)
! java.net.BindException: Address already in use (Bind failed)
! at java.net.PlainSocketImpl.socketBind(Native Method) ~[na:1.8.0_161]
! at java.net.AbstractPlainSocketImpl.bind(AbstractPlainSocketImpl.java:387) ~[na:1.8.0_161]
! at java.net.ServerSocket.bind(ServerSocket.java:375) ~[na:1.8.0_161]
! at java.net.ServerSocket.(ServerSocket.java:237) ~[na:1.8.0_161]
! at se.kth.karamel.webservice.KaramelServiceApplication.(KaramelServiceApplication.java:111) ~[karamel-ui-0.6.jar:na]

Karamel can unfortunately not use any other port than 9090.

If there is already an instance of karamel running then you should be able to identify the process with netstat -tulpen | grep 9090 and then kill it or use jps -l to find the relevant process.

I stopped the process using 9090 port and reran the setup. There’s just a karamel ui at port 9090, nothing like the hopsworks page

Karamel is the installer. Wait for it to finish and Hopsworks will appear on port 443. You can check the state of the installation in the karamel ui under Menu, Terminal, status.

the karamel status is stuck at-

Hops is RUNNING_DAG but it is on pause.

Passed Phases:


| Phase | Status | Duration|

  1. | PRECLEANING | SUCCEED| 2 |
  2. | FORKING_GROUPS | SUCCEED| 1 |
  3. | FORKING_MACHINES| SUCCEED| 131 |

Tasks’ Status:


| OS Family| Public IP | Private IP | SSH Port| SSH User | Life Status| Task Status|

| ? | 10.113.41.11| 10.113.41.11| 22 | subexgpu5| FORKED | ONGOING |
__________________________________________________________________
| Task | Status | Actions| Duration(ms)|
==================================================================

  1. | find os-type | READY | | |
  2. | apt-get essentials | WAITING| | |
  3. | install chefdk | WAITING| | |
  4. | make solo.rb | WAITING| | |
  5. | clone and vendor hopsworks-chef| WAITING| | |
  6. | epipe::install | WAITING| | |
  7. | epipe::default | WAITING| | |
  8. | hops_airflow::install | WAITING| | |
  9. | hops_airflow::default | WAITING| | |
  10. | hops_airflow::sqoop | WAITING| | |
  11. | hopslog::install | WAITING| | |
  12. | hopslog::_filebeat-spark | WAITING| | |
  13. | hopslog::_filebeat-beam | WAITING| | |
  14. | hopslog::default | WAITING| | |
  15. | hopslog::_filebeat-serving | WAITING| | |
  16. | hopslog::_filebeat-kagent | WAITING| | |
  17. | hops::install | WAITING| | |
  18. | hops::dn | WAITING| | |
  19. | hops::rm | WAITING| | |
  20. | livy::default | WAITING| | |
  21. | hops::nm | WAITING| | |
  22. | hops::jhs | WAITING| | |
  23. | flink::yarn | WAITING| | |
  24. | flink::historyserver | WAITING| | |
  25. | tensorflow::default | WAITING| | |
  26. | hadoop_spark::yarn | WAITING| | |
  27. | hadoop_spark::historyserver | WAITING| | |
  28. | hadoop_spark::certs | WAITING| | |
  29. | hive2::default | WAITING| | |
  30. | hops::nn | WAITING| | |
  31. | hopsworks::install | WAITING| | |
  32. | hopsworks::default | WAITING| | |
  33. | kagent::default | WAITING| | |
  34. | consul::master | WAITING| | |
  35. | kzookeeper::default | WAITING| | |
  36. | kkafka::default | WAITING| | |
  37. | elastic::default | WAITING| | |
  38. | hopsmonitor::default | WAITING| | |
  39. | hops::ndb | WAITING| | |
  40. | kkafka::install | WAITING| | |
  41. | kzookeeper::install | WAITING| | |
  42. | conda::install | WAITING| | |
  43. | conda::default | WAITING| | |
  44. | hopsmonitor::install | WAITING| | |
  45. | hopsmonitor::prometheus | WAITING| | |
  46. | hopsmonitor::node_exporter | WAITING| | |
  47. | hopsmonitor::alertmanager | WAITING| | |
  48. | hopsmonitor::purge_telegraf | WAITING| | |
  49. | ndb::install | WAITING| | |
  50. | ndb::ndbd | WAITING| | |
  51. | ndb::mysqld | WAITING| | |
  52. | ndb::mgmd | WAITING| | |
  53. | elastic::install | WAITING| | |
  54. | livy::install | WAITING| | |
  55. | hadoop_spark::install | WAITING| | |
  56. | tensorflow::install | WAITING| | |
  57. | flink::install | WAITING| | |
  58. | kagent::install | WAITING| | |
  59. | consul::install | WAITING| | |
  60. | hive2::install | WAITING| | |

also the installation logs-

ERROR [2020-09-11 06:09:05,449] se.kth.karamel.backend.machines.MachinesMonitor:
! net.schmizz.sshj.userauth.UserAuthException: Exhausted available authentication methods
! at net.schmizz.sshj.SSHClient.auth(SSHClient.java:225) ~[sshj-0.20.0.jar:na]
! at net.schmizz.sshj.SSHClient.authPublickey(SSHClient.java:340) ~[sshj-0.20.0.jar:na]
! at net.schmizz.sshj.SSHClient.authPublickey(SSHClient.java:359) ~[sshj-0.20.0.jar:na]
! at se.kth.karamel.backend.machines.SshMachine.connect(SshMachine.java:491) ~[karamel-core-0.6.jar:na]
! … 7 common frames omitted
! Causing: se.kth.karamel.common.exception.KaramelException: 10.113.41.11: Authentication problem using ssh keys.
! at se.kth.karamel.backend.machines.SshMachine.connect(SshMachine.java:523) ~[karamel-core-0.6.jar:na]
! at se.kth.karamel.backend.machines.SshMachine.ping(SshMachine.java:549) ~[karamel-core-0.6.jar:na]
! at se.kth.karamel.backend.machines.MachinesMonitor.run(MachinesMonitor.java:101) ~[karamel-core-0.6.jar:na]
! at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [na:1.8.0_161]
! at java.util.concurrent.FutureTask.run(FutureTask.java:266) [na:1.8.0_161]
! at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [na:1.8.0_161]
! at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [na:1.8.0_161]
! at java.lang.Thread.run(Thread.java:748) [na:1.8.0_161]

Does the user you use for installation have sudo rights? Also can you check if it is able to ssh into localhost?

Yes, the user has sudo privelage and can ssh onto localhost

The error is indicating that karamel is failing to SSH into localhost. It does that for running the setup. Can you double-check that ssh for that user is set up passwordless and localhost was added to trusted host by sshing into localhost with that user?

That helped, Thanks!

the “conda::install” task keeps failing, even after “retry”

REDHAT task has been in “paused” for some time

Tasks’ Status:


| OS Family| Public IP | Private IP | SSH Port| SSH User | Life Status| Task Status|

| REDHAT | 10.113.41.11| 10.113.41.11| 22 | subexgpu5| CONNECTED | PAUSED |

Can you send the logs/error from the conda install task?