Distributed file system, which one would you consider?

To:	"LinuxVirtualServer.org users mailing list." <lvs-users@xxxxxxxxxxxxxxxxxxxxxx>
Subject:	Distributed file system, which one would you consider?
From:	Jan Klopper <janklopper@xxxxxxxxx>
Date:	Fri, 22 Apr 2005 14:05:18 +0200

Hi,

Im currently running a uber high usage LAMP cluster,

This cluster does a few thousand small reads per Sec, and thus i useReiserFS for local reads, since this makes sure the reads are as fast aspossible.

I use Unison to replicate all changes from the FTP server (not insidethe cluster) to all of the cluster nodes and back each 2 minutes, Thisgives me two problems and also some advantages.

Problems:

2 minute wait time, (more if the 3th server updates the FTP servertrough unison and the second server only gets the file the next round)If the server updates the files, or create new ones, (for example acache file) they won;t get propogated to the other servers for a fweminutes.Unison is pretty cpu and bandwidth hungry, hence the 2 minuteinterval decided upon.

Advantages:

All servers use the same mysql, and thus they will create the samecache files anyway.All servers use their own super speedy local ReiserFS storage. (asif they were not in any cluster at all)


Now i looked at coda, but it doens't support 2 way replication properly.

(And i want it to do multiple updates, to elliminate the center masternode. (eg,update to both of its neigbours))GFS, needs special storage hardware, and i don't have that. nor do ithink its the way forward for linux clusters.

Some others don't look production ready, or don't look designed for thiswork.


What i would see as the prefect solution would be something like this:

Place hooks in the filesystem, to run the cluster tool on write.

The cluster tool propogates the changes to both neighbouring servers,and also sends an unique ID.The tool stores the ID, as handled for a while. (100.000 max toensure dos attack is not possible?)

The other servers cluster tools listen and receive the ID, check to seeif they handled it previously, and since they didn't ask for the files.The servers write the files, without triggering their own updatemechanism, but trigger a propgation tool with the received ID.This would update files around the entire cluster, would haveconfigurable paths, and would not give any problems with serversupdating in an endless loop.

I know this doesn't handle file locks etc. But i think it handles mostsimple scenarios. (LAMP apps for example)


Any toughts on this?

greets
Jan

<Prev in Thread]	Current Thread	[Next in Thread>
Distributed file system, which one would you consider?, Jan Klopper <= Re: Distributed file system, which one would you consider?, Mack . Joseph Re: Distributed file system, which one would you consider?, Patrick Walsh Re: Distributed file system, which one would you consider?, Nate Carlson Re: Distributed file system, which one would you consider?, gan hawk

Previous by Date:	Re: [ANNOUNCE] Ultra Monkey 3, Horms
Next by Date:	Re: Distributed file system, which one would you consider?, Mack . Joseph
Previous by Thread:	One Real IIS Server & Multiple Web Sites, Brad Taylor
Next by Thread:	Re: Distributed file system, which one would you consider?, Mack . Joseph
Indexes:	[Date] [Thread] [Top] [All Lists]