I've been thinking about what kind of content would be useful here. Those of you who know me know that I'm happy to talk about almost anything. But not all things are useful, and there are more useful things to talk about than I have time to talk about. So, here are some of my thoughts from recent conversations with people and reading Russell Coker's HA blog postings[1], and a request[2] from Emily Ratliff's blog.
- Finish the series on Disaster Recovery
- Talk about the capabilities of the policies you can express in Linux-HA's[3] CIB
- How repeated failures accumulate in failure stickiness
- The importance of civility and friendliness in projects - how Linux-HA[3] has succeeded and failed in this way, and why
- Fencing (STONITH[4]) discussion
- Quorum and the relationship to STONITH (fencing)
- HA and virtualization - HA as virtualization, virtualization as HA[5].
- Linux-HA[3] capability tour
- HA/DR training and certification
- Single System Image - do you need it?
- Tracking Linux-HA installs - how many people use it and how do you figure it out?
Please chime in with your thoughts and suggestions on which of these (or other) topics interest you.
PS: Comments should be turned on for the blog now. Sorry they weren't on at first for my previous posting.
Links:
[1] http://etbe.coker.com.au/category/ha/
[2] http://www.ratliff.net/blog/index.php/2007/09/10/alan-robertsons-new-blog-on-managing-computers/
[3] http://linux-ha.org/
[4] http://linux-ha.org/STONITH
[5] http://www.byteandswitch.com/document.asp?doc_id=133643&WT.svl=news1_3
Hi Alan,
I'd like to register a request for you to blog about single system failure. Do you like any of the daemon restart programs - dwatch, monit, etc? Do you have a different recommended solution? Can heartbeat monitor daemons on a single system?
Thanks!
Emily
Posted by: Emily Ratliff | 14 September 2007 at 08:39