Integrate HDFS with Amazon Kinesis Firehose
HDFS is a Java-based file system that provides scalable and reliable data storage, and it was designed to span large clusters of commodity servers. HDFS has demonstrated production scalability of up to 200 PB of storage and a single cluster of 4500 servers, supporting close to a billion files and blocks.
About Amazon Kinesis Firehose
Amazon Kinesis Firehose is a fully managed service for delivering real-time streaming data to destinations such as Amazon Simple Storage Service (Amazon S3) and Amazon Redshift.
With Firehose, you do not need to write any applications or manage any resources. You configure your data producers to send data to Firehose and it automatically delivers the data to the destination that you specified.
Amazon Kinesis Firehose is the easiest way to load streaming data into AWS.
Popular Use Cases
Bring all your Amazon Kinesis Firehose data to Amazon Redshift
Load your Amazon Kinesis Firehose data to Google BigQuery
ETL all your Amazon Kinesis Firehose data to Snowflake
Move your Amazon Kinesis Firehose data to MySQL