Subject: notes from todays meeting |
From: Wahid Bhimji <wbhimji@staffmail.ed.ac.uk> |
Date: 27/08/2014 10:40 |
To: "SC) Jensen Jens (STFC RAL" <jens.jensen@stfc.ac.uk> |
wahid: (27/08/2014 10:01)
https://indico.cern.ch/event/324705/page/0
https://ggus.eu/index.php?mode=ticket_info&ticket_id=107884
Ewan Mac Mahon: (10:06 AM)
Apache is running on ~20% of my nodes.
It's variously dead or stopped on all the others.
Samuel Cadellin Skipsey: (10:07 AM)
We actually have a service to restart our httpd to keep them up.
wahid: (10:07 AM)
http://dashb-atlas-ssb.cern.ch/dashboard/request.py/siteview#currentView=FAX+endpoints&highlight=false
Ewan Mac Mahon: (10:08 AM)
Speaking of test failures; are there autmated tests for the webdav? Because if it doesn't appear in the dashboard, it's not realy reasonable to consider it a production service.
John Bland: (10:08 AM)
our puppet keeps httpd going on our headnode, but not the nodes. I've not seen any crashing
Duncan Rand: (10:09 AM)
The fact the chat window has no history is irritating...
Ah, if I type something it all magically appears..
Jeremy Coles: (10:10 AM)
Agreed. I have fed a few issues back.
Ewan Mac Mahon: (10:10 AM)
Ceph, ceph, ceph, ceph, ceph.
That's about it for storage.
Jeremy Coles: (10:10 AM)
back = fedback.
Duncan Rand: (10:12 AM)
Ewan: which 'dashboard' are you referring to?
Ewan Mac Mahon: (10:12 AM)
The one that makes the ROD people ticket sites when things break.
Bah, humbug.
Maybe CephFS and StoRM?
Clearly the way of the future.
Gareth Douglas Roy: (10:26 AM)
Yeah but this means anything you want to use has to provide POSIX......
Ewan Mac Mahon: (10:31 AM)
That's a wonderful use of the phrase 'well understood' :-)
Post a link to the list?
The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.