Integrate HDFS with Amazon Redshift
HDFS is a Java-based file system that provides scalable and reliable data storage, and it was designed to span large clusters of commodity servers. HDFS has demonstrated production scalability of up to 200 PB of storage and a single cluster of 4500 servers, supporting close to a billion files and blocks.
About Amazon Redshift
Amazon Redshift is a fast, fully managed data warehouse solution that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. But there’s one big issue: how do you get your data into Redshift in the first place?