Google Cloud Spanner

HDFS and Google Cloud Spanner Integration

About HDFS

HDFS is a Java-based file system that provides scalable and reliable data storage, and it was designed to span large clusters of commodity servers. HDFS has demonstrated production scalability of up to 200 PB of storage and a single cluster of 4500 servers, supporting close to a billion files and blocks.

About Google Cloud Spanner

Google Spanner is a Google Cloud-based database system that is ACID compliant, horizontally scalable, and global. Spanner is the database that underpins much of Google’s own data collection, and it has been designed to offer the consistency of a relational database along with the freedom of a non-relational one. To help facilitate high availability, Spanner includes automatic synchronous replication, which allows for your data to always be up-to-date while still being as consistent as it would be with a traditional relational database. Additionally, just like any relational database, Spanner can handle SQL queries and schemas, and it even supports an array of languages.
Since Google developed Spanner for its own global use, it has a built in global functionality, which allows spanner to scale to fit your needs. Your data can be stored on five nodes in one region or two hundred thousand nodes scattered around the world. This means your data is available whenever and wherever you need it. Spanner facilitates the dataflow for the likes of Adwords and Gmail, so it’s built to meet your business needs no matter how globally-reaching your company is.