Have you considered mongodb? It's a no-sql that internally uses a binary form of json. It provides some sql-ish query functionality, and indexing. Json allows most of the benefits of XML (structured documents) in a much more compact format. It is scalable (sharding, replication), and has a map-reduce functionality built right into it.