Welcome to Xplenty's Blog

All things data

Cloud vs. On-Prem Data Warehouse

Cloud vs. On-Prem Data Warehouse

Data warehouses (DWHs) are widely recognized as essential components of business intelligence and analytics operations. But the question of whether the optimal deployment route is in the cloud or on-premise...

Xplenty Asana Integration

Xplenty Asana Integration

Xplenty, swoops in as a superhero in the world of ETL, to save you time (which means money) getting all that data into your company’s analytic data store. Grab data...

Xplenty HipChat Integration

Xplenty HipChat Integration

Xplenty integration with HipChat is super fast and incredibly easy using Xplenty’s HipChat hooks. Xplenty’s HipChat hooks allow you to get real-time updates about your account activity into your HipChat...

Xplenty Salesforce Integration

Xplenty Salesforce Integration

Xplenty's integration with Salesforce is the easiest way to read and process your data from Salesforce and move it to an analytic data stores such as Amazon Redshift or Google...

Xplenty PagerDuty Integration

Xplenty PagerDuty Integration

We already integrate with many tools to reduce the bulk of time you likely spend on cleaning your data and preparing it for analytics. We want to continue to make...

Xplenty Slack Integration

Xplenty Slack Integration

At Xplenty, we aim to make moving and processing data as quick and seamless as possible.
We integrate with a ton of tools your teams already uses to make...

Who said ETL has to be hard?

Who said ETL has to be hard?

ETL is a critical, necessary process for almost all analytics projects. But the harsh reality is that ETL is complicated, with many challenges along the way, and implementation can be...

5 Platforms for Collecting Big Data

5 Platforms for Collecting Big Data

Everything comes as a service these days, and so does collecting Big Data. Various platforms on the web are happy to take data collection off your coding hands, making it...

Amazon Redshift Review 2015

Amazon Redshift Review 2015

Happy birthday to Redshift! Amazon’s data warehouse-as-a-service has just celebrated two years of data querying. Several reviews were written about Redshift at the time, but as far as we know,...

Spark vs. Tez: What's the Difference?

Spark vs. Tez: What's the Difference?

On paper, Spark and Tez have a lot in common: both possess in-memory capabilities, can run on top of Hadoop YARN and support all data types from any data sources....

Top 5 Big Data Events

Top 5 Big Data Events

Every year, dozens of Big Data conferences take place all over the world, from San Francisco to Shanghai. Now that 2015 is finally here, it’s time to open up your...

Xplenty’s Data Processing Survey

Xplenty’s Data Processing Survey

We want to know more about the Big Data community: what causes them headaches? What makes them happy? Which tools and technologies do they use? When we had our booth...

Top 7 Hadoop Blogs for 2014

Top 7 Hadoop Blogs for 2014

People talk a lot about Hadoop, and we like to keep up to date with the latest gossip by reading Hadoop blogs. If you'd also like to jump into the...

Top 3 Updates from AWS re:Invent 2014

Top 3 Updates from AWS re:Invent 2014

Amazon had some great news at AWS re:Invent 2014 in Vegas—and we were there to hear it. Most of them were geared toward enterprises and developers, but three stood out...

Xplenty Gmail Action Buttons

Xplenty Gmail Action Buttons

Xplenty notification emails now include “View Job” and “View Cluster” action buttons. The buttons are available at the inbox level, so you can easily interact with jobs and clusters in...

Spark vs. Hadoop MapReduce

Spark vs. Hadoop MapReduce

Apache Spark is setting the world of Big Data on fire. With a promise of speeds up to 100 times faster than Hadoop MapReduce and comfortable APIs, some think this...

5 Hadoop Security Projects

5 Hadoop Security Projects

Following our post about Hadoop security for the enterprise, or the lack thereof, one of the ways to make Hadoop more secure is by installing an additional platform. Five major...

Is Hadoop Secure for the Enterprise?

Is Hadoop Secure for the Enterprise?

While Hadoop has proved its power for scalable storage and processing of Big Data, it may not be enterprise-ready when it comes to security. Hortonworks, Cloudera and MapR address this...

Hadoop YARN Turns One: Hadoop Renaissance

Hadoop YARN Turns One: Hadoop Renaissance

These days, there is a renaissance of Hadoop-based Big Data projects: Impala, Spark, Storm, Flink and HBase as well as several SQL-on-Hadoop tools. Most of these projects are still in...

Transform Data from Amazon RDS with Xplenty

Transform Data from Amazon RDS with Xplenty

How do you integrate data from Amazon RDS (Relational Database Service) with data from other sources such as S3, Redshift, or even MongoDB? The answer is Xplenty. Our data integration...

Offload Redshift ETL to Xplenty

Offload Redshift ETL to Xplenty

Xplenty can read data from SQL Server, MongoDB, SAP HANA, and many more data stores. One of the many DBs Xplenty integrates with is Redshift.

Prepare Data for Analysis in Heroku

Prepare Data for Analysis in Heroku

Some developers need to process data. Maybe you work in a small startup where people take on several roles, or maybe in an enterprise company where you are asked to...

Processing JSON Data on the Cloud

Processing JSON Data on the Cloud

We’ve processed plenty of JSON data in our blog - from Tweets to GitHub commits - but we’ve never really discussed how to process JSON with Xplenty’s data integration on...

Mining Dark Data without Hadoop

Mining Dark Data without Hadoop

Hadoop is definitely a great solution for processing dark data, but what if you don’t know how to use it? Hadoop requires you to buy new hardware, provide expert maintenance,...

How to get Website Visitor Geolocations from IPs

How to get Website Visitor Geolocations from IPs

Although the Internet made the world flat, geography still matters. Knowing which countries your users live in could provide business opportunities to localize your services and increase profits. The only...

Parsing AWS CloudTrail Log Files

Parsing AWS CloudTrail Log Files

Amazon’s CloudTrail is a service that logs AWS activity. However, these logs need some preparation before they can be analyzed. In this post, we’ll see how to parse these log...

Parsing User Agent Strings in Big Data

Parsing User Agent Strings in Big Data

Big Data brother is watching - whenever users surf your website, their browser sends an HTTP header called ‘User Agent’. It tells your web server which browser they’re using, in...

8 Data Integration Best Practices

8 Data Integration Best Practices

You’ve spent hours tinkering and preparing the perfect dataflow to batch process zillions of web logs. Feeling satisfied, you run the job on one of the clusters and leave your...

Using Regular Expressions in Big Data

Using Regular Expressions in Big Data

A regular expression, AKA regex, is a powerful yet really confusing tool. Although regular expressions are the technology behind text replacement and natural language processing, they are hard to read,...

Processing Unstructured Data 101

Processing Unstructured Data 101

Unstructured data is big - according to IDC, about 90 percent of the storage in the world is used for unstructured data. It comes as no surprise considering the amount...

Hive vs. HBase

Hive vs. HBase

Comparing Hive with HBase is like comparing Google with Facebook - although they compete over the same turf (our private information), they don’t provide the same functionality. But things can...

Amazon EMR vs. Xplenty

Amazon EMR vs. Xplenty

Amazon launched Elastic Map Reduce (EMR) to make Hadoop easier, but there were still too many Hadoop hoops to jump through before processing Big Data. That’s why we founded Xplenty....

Hadoop ETL with Apache Pig

Hadoop ETL with Apache Pig

What does it mean to be a pig? Well, according to the philosophers behind the Apache Pig project pigs eat anything, live anywhere, and are domestic animals. They even claim...

Hadoop Data Integration 101

Hadoop Data Integration 101

Last year Cloudera published a blog post on Big Data’s new use cases: transformation, active archive, and exploration. There’s one more use case that isn’t explicitly mentioned - data integration....

Use Data on the Cloud to Measure KPIs

Use Data on the Cloud to Measure KPIs

Huge amounts of data are needed to calculate key performance indicators (KPIs), a luxury that only large enterprises were able to afford. This post series discusses how companies of all...

12 SQL-on-Hadoop Tools

12 SQL-on-Hadoop Tools

An overview of 12 open source and commercial SQL-on-Hadoop tools: Apache Hive, Apache Sqoop, Apache Phoenix, Impala, Presto, BigSQL, CitusDB, Hadapt, Jethro, Lingual, and HAWQ.

OpenSSL Security Update

OpenSSL Security Update

On Monday April 7th an OpenSSL security update was released for the "Heartbleed" vulnerability. Here at Xplenty we've immediately installed the security update to protect ourselves from this vulnerability and...

8 SQL-on-Hadoop Challenges

8 SQL-on-Hadoop Challenges

Introducing Apache Hadoop to the organization can be difficult - everyone is trained and experienced in the old ways of SQL and all the analytics tools integrate with SQL. Certain...

Fear of a Hadoop Planet

Fear of a Hadoop Planet

Despite the Hadoop hype machine crunching away, not everyone is fond of that little yellow elephant. In fact, some fear it. But why should the cute mammal and the innovative...

Hadoop-as-a-Service vs. On-Premise...FINISH HIM

Hadoop-as-a-Service vs. On-Premise...FINISH HIM

Mortal Kombat’s master of ice Sub-Zero and the living-dead fire breathing Scorpion are major archenemies. As the story goes, Sub-Zero and his clan of assassin ninjas slaughtered their rival clan,...

Updates for Spring 2014

Updates for Spring 2014

Hey all you out there in Xplentyland. We haven't forgotten about you. In fact, we've been thinking about you since the last update regarding new features, because we've been adding...you...

Hadoop in the Streets of London

Hadoop in the Streets of London

Last week I packed my suitcase and got on a plane to London. The agenda - presenting at the February Hadoop Users Group UK meetup. The meetup was supposed to...

4 Tips on Collecting Streaming Data

4 Tips on Collecting Streaming Data

Readers of our blog should know by now that Apache Hadoop is great for offline batch processing of Big Data. But what about online streaming data? What if you’re running...

Hadoop vs. Redshift

Hadoop vs. Redshift

Childhood dreams do come true - in 2015 "Batman vs. Superman" will bring the world’s biggest superheroes to battle on-screen, finally solving that eternal debate who will prevail (I put...

Xplenty is Now Available on Rackspace

Xplenty is Now Available on Rackspace

Xplenty is excited to announce we now support Rackspace’s public cloud offering. Rackspace users can now use Xplenty’s Data Integration-as-a-Service to process their data. Xplenty provides you with out of...

5 Funnel Analysis Technical Challenges

5 Funnel Analysis Technical Challenges

Funnel analysis is awesome. Whether your company has a checkout, a registration, or any kind of process on a website or even in real life, funnel analysis lets you see...

7 Tips to Improve ETL Performance

7 Tips to Improve ETL Performance

Consider for a moment, if you will, plastic patio furniture. Plastic Fantastic is a global manufacturer with several factories, warehouses, and plenty of stores. One can only imagine the sheer...

End of 2013 Updates

End of 2013 Updates

We've decided to end the year with some platform upgrades for you. Just a few more tweaks to make Xplenty Hadoop-as-a-Service that much more functional.

Xplenty Provides Datetime Data Type

Xplenty Provides Datetime Data Type

We’ve decided to end the year with some platform upgrades for you. Just a few more tweaks to make Xplenty Hadoop-as-a-Service that much more functional.

Xplenty Winter 2013 Updates and Improvements

Xplenty Winter 2013 Updates and Improvements

Happy holidays everyone! We've been working hard to make the Xplenty Hadoop-as-a-Service platform better and more user friendly for you. We hope you're enjoying your experience with us, and these...

Xplenty Provides Usability Improvements

Xplenty Provides Usability Improvements

Happy holidays everyone! We’ve been working hard to make the Xplenty Hadoop-as-a-Service platform better and more user friendly for you. We hope you’re enjoying your experience with us, and these...

Why Santa needs Big Data

Why Santa needs Big Data

Now, Hadoop! Now, SQL! Now, NOSQL, and Opensource! On, Cloud technology! On, Apps! On, SaaS, and Infrastructure!

Day one in the books, on to day two!

Day one in the books, on to day two!

The first full day of re:Invent has come and gone, and it was a great day. Thousands of people from all areas of technology connecting and learning, and most importantly...

Xplenty partners with Hortonworks!

Xplenty partners with Hortonworks!

We're excited to partner with Hortonworks, the makers of the Hortonworks Data Platform. We believe Hortonworks plays a significant role in the Hadoop ecosystem and powering Xplenty with HDP provides...

Xplenty Autumn Updates

Xplenty Autumn Updates

Let us just get this out of the way right now - it’s been a while since we've written. But we have a good reason, honest we do. We've been...

Big Data Today

Big Data Today

Everyone is always looking for the next big thing.  Big Data isn’t the next big thing, it’s already the big thing. Anyone in an industry that collects mass amounts of...

Big Data and Your Body

Big Data and Your Body

A couple of weeks ago we posted about the datafication of everything, with a few examples of what can be quantified, and what we can do with that quantified data....

Cooking with Big Data in the Food Industry

Cooking with Big Data in the Food Industry

When you think of Big Data, the types of companies using it are probably technology based, perhaps in the finance sector, marketing, websites, and anyone else that needs to process...

Huge Data?

Huge Data?

I remember one time on a status update thread, a friend mentioned something about being bad at finding her way around Tel Aviv.  My response was that probably in the...

ETL - Is it Still Relevant?

ETL - Is it Still Relevant?

Buzz about Big Data has been at fever pitch for over a year now. We hear a lot about how the insights we glean will propel businesses, about emerging technologies, and...

Prepare for launch!

Prepare for launch!

Here, at Xplenty we’re gearing up for the release of our Hadoop-as-a-Service platform in a few days’ time. We’re very excited after working really hard in the past year to...

Xplenty Has Launched a Public Release

Xplenty Has Launched a Public Release

Xplenty is pleased to announce we are launching a public release. Now anyone can sign up, get a free 14 day trial period in which they can build packages and...

Under Lock and Key

Under Lock and Key

With your big data comes the need to secure it.  How  awful would it be  if your competition got a hold of your data and used it against you, to...

Welcome to Xplenty!

Welcome to Xplenty!

We're very happy to launch the Xplenty site and to share our vision - and of course our product - with you.