From: snoze pa (snoze.pa_at_gmail.com)
Date: Fri Sep 30 2005 - 10:35:22 CDT
Hi,
I am a new user of namd and running it in my 15 node cluster. When I am
running it using node list I am getting an error message.
I think this error is related to fix the node list informatiom. My main
machine from where I am running namd has some ip address
210.10X.XX.XXX, while my nodes has IP address in the series of 192.XX.XXX.1,
192.XX.XXX.2...........,192.XX.XXX.15.
When I am running namd it is printing message that
Charmrun> adding client 0: "node1", IP:192.XX.XXX.1
Charmrun> adding client 1: "node2", IP:192.XX.XXX.2
Charmrun> adding client 2: "node3", IP:192.XX.XXX.3
...
...
Charmrun> adding client 8: "node9", IP:192.XX.XXX.15
Charmrun> Charmrun = 210.10X.XX.XXX, port = 43420
Charmrun> Sending "0 210.10X.XX.XXX 43420 12445 0" to client 0.
Charmrun> find the node program "/usr/home/DEL/namd2" at
"/usr/home/DEL/10zero" for 0.
Charmrun> Starting rsh node1 -l XXX /bin/sh -f
Charmrun> Sending "1 210.10X.XX.XXX 43420 12445 0" to client 1.
Charmrun> find the node program "/usr/home/namd2" at "/usr/home/DEL/10zero"
for 1.
Charmrun> Starting rsh node2 -l XXX /bin/sh -f
Charmrun> Sending "2 210.10X.XX.XXX 43420 12445 0" to client 2.
......
......
but at the end it is printing following message
node3: Connection refused
node5: Connection refused
node6: Connection refused
node4: Connection refused
node2: Connection refused
node7: Connection refused
node9: Connection refused
node8: Connection refused
node12: Connection refused
node14: Connection refused
node13: Connection refused
node15: Connection refused
MY MAIN DIR(Master node) from where I am running namd is :: /usr/home/DEL/
and the nodes are located in the following dir
/cluster-share/DEL
and my nodelist has following entires
host node1
host node2
......
......
host node14
host node15
Any suggestion about the connection refused error message. Or anyone can
provide me nodelist based on above information.
thanks in advance
snoze
This archive was generated by hypermail 2.1.6 : Wed Feb 29 2012 - 15:41:10 CST