[INSTRM-935] Make actors re-connect after tron restart Created: 30/Mar/20  Updated: 30/Mar/20

Status: Open
Project: Instrument control development
Component/s: tron_actorcore
Affects Version/s: None
Fix Version/s: None

Type: Task Priority: Normal
Reporter: cloomis Assignee: cloomis
Resolution: Unresolved Votes: 0
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original Estimate: Not Specified

Story Points: 3

 Description   

The machine which hosts tron at JHU rebooted. The actors running on other machines did not re-connect after the server came back up. I'm not sure whether this ever worked, or whether we just assumed that that machine rebooting signals something dire.

Note that at JHU, one machine does everything: tron, postgresql, the archiver, DHCP, DNS, NFS, etc. etc. So it may be that things cannot recover cleanly only at JHU.

The quick workaround is to reboot the client machines (at JHU, only the BEEs) and let ics_launch reconnect things.

In any case, look into it.


Generated at Sat Feb 10 16:30:10 JST 2024 using Jira 8.3.4#803005-sha1:1f96e09b3c60279a408a2ae47be3c745f571388b.