James Westby james.westby@canonical.com writes:
On Mon, 02 Jul 2012 17:31:08 +1200, Michael Hudson-Doyle michael.hudson@canonical.com wrote:
Multi-machine data
Accessing postgres from another machine is a solved problem, to put it mildly :-)
I don't have a good idea on how to access media files across the network. In an ideal world, we would have a Django storage backend that talked to something like http://ceph.com/ceph-storage/object-storage/ or http://hadoop.apache.org/hdfs/ -- we don't need anything like full file system semantics -- but for now, maybe just mounting the media files over NFS might be the quickest way to get things going.
Is it possible to have the other machines explicitly fetch the files they need from the current primary machine?
All (or at least >1) nodes need to write.
That avoids having to set up infrastructure to share files in this manner at the cost of having the fetches going on (which should be reasonably quick)
Otherwise that would be a great idea :-)
We could do something like have all files be written locally (only one node will need to write to a given file) and served over http to other nodes but eh. I don't want to implement something myself here.
If implicit sharing is wanted/needed then ceph/hdfs seems to be a great way to go, or nfs to start with as you say.
Cheers, mwh