GitHub and HDFS Integration
GitHub is how people build software. With a community of more than 10 million people, developers can discover, use, and contribute to over 26 million projects using a powerful collaborative development workflow.
HDFS is a Java-based file system that provides scalable and reliable data storage, and it was designed to span large clusters of commodity servers. HDFS has demonstrated production scalability of up to 200 PB of storage and a single cluster of 4500 servers, supporting close to a billion files and blocks.