I am installing hops works in ubuntu 18.04 in the Azure environment. I am getting the below-mentioned error in kagent::default installation.
15T09:14:30Z" level=error msg="Failed to perform HTTP operation - status: 404 Retrying... {\"type\":\"restApiJsonResponse\",\"errorCode\":100025,\"errorMsg\":\"Host was not found.\",\"usrMsg\":\"hostname: h1.a.com\"}"
e[0m time="2021-12-
Please find below my /etc/hosts file
127.0.0.1 localhost
10.0.0.4 h1.a.com
# The following lines are desirable for IPv6 capable hosts
::1 ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters
ff02::3 ip6-allhosts
I think your issue was quite similar to my previous one Hopsworks-installer.sh kagent:defaut register host error
Can you double check which IP kagent host runs? then you can just change the localhost pointing to that IP.
and restart hostnamed
sudo systemctl restart systemd-hostnamed
@Yingding
Please find below my /etc/hosts/
127.0.0.1 localhost 127.0.0.6
127.0.0.6 h1.a.com
I am still facing the issue
I also observed that nothing was running in 443 port where they are running the web server
Please find below my /etc/hosts/
127.0.0.1 localhost 127.0.0.6
127.0.0.6 h1.a.com
I am still facing the issue
Some more details about your installation might help, are you using the hopsworks-installer.sh
from branch 2.4? or the hopsworks-cloud-installer.sh
. I used hopsworks-installer.sh
on my on-prem instance.
Here are a couple of the things you shall double check before you start your installation.
- Are egress port 443 and 80 open on your system?
- Is your sudo user set with NOPASS in /etc/sudoers with
%sudo ALL=(ALL) NOPASSWD:ALL
?
Here are infos to debug your installation errors:
- Did you got an output from the
hopsworks-installer.sh
such as The Karamel installer UI will soon start at private IP: http://Your_IP:9090/index.html
- Can you connect to this uri
http://Your_IP:9090/index.html
during the installation?
- While you are connected to the
http://Your_IP:9090/index.html
, please select
menu -> Terminal -> List Clusters
to examine error during the DAG installations, and restart the install tasks
Should you be able to see detailed error, would be great if you can post it in this thread.
I also observed that nothing was running in 443 port where they are running the web server
The port 443 is the glassfish java web server, where the hopsworks feature store runs.
With
sudo systemctl status glassfish-domain1
You can see if there is any glassfish systemd service active.
Wish you further success in your installation.
127.0.0.1 localhost 127.0.0.6
127.0.0.6 h1.a.com
sorry, as i mentioned change your local host point to DNS IP is missleading, what i mean is
127.0.0.1 localhost
<your host ip or your dns name> <your host name>
...
Is your host name h1.a.com
or h1
?
Normally, the host name shall be pointed to 127.0.0.1
for standard ubuntu installation, but the kagent service listens to the host ip. Your issue might be also that the kagent service is not running, thus you shall login to karamel during the installaton and get more details regarding your issue.