Welcome to Xplenty's Blog

All things data

Xplenty has made it to StartUp Big 50 list! Thank you, StartUp50.com for including us

Xplenty has made it to StartUp Big 50 list! Thank you, StartUp50.com for including us

Today, we have great news to share - Xplenty was included in StartUp50.com’s Annual StartUp Big 50 List. To earn our well-deserved spot in the Big 50 we had to battle our way through three rounds of tough competition with over 200 applicants vying for 50 spots..

Cloud vs. On-Prem Data Warehouse

Cloud vs. On-Prem Data Warehouse

Data warehouses (DWHs) are widely recognized as essential components of business intelligence and analytics operations. But the question of whether the optimal deployment route is in the cloud or on-premise remains hotly debated.

Like so many things, the truth is that there is no one-size-fits-all solution. Every business is different, and there are advantages and disadvantages in both approaches. On the one hand the cloud offers scalability and low entry cost advantages. On the other, there’s the security and flexibility that only an on-prem solution can offer.

Confessions of a Data Scientist

Confessions of a Data Scientist

Igor Elbert, Principal Data Scientist at Gilt.com, shares with Xplenty his thoughts on recent data projects, challenges and his approach to ETL.

Recap: Xplenty Data Panel → Big Data's 'Janitor' Problem - Is it Killing ROI?

Recap: Xplenty Data Panel → Big Data's 'Janitor' Problem - Is it Killing ROI?

May was a crazy month here at Xplenty. We started our event tour at Collision in Las Vegas. Then we headed over to NYC for the Data Summit where CEO, Yaniv Mor, had 2 speaking slots, both packed to capacity!

Cloud Data Integration - Myth vs. Reality

Cloud Data Integration - Myth vs. Reality

So you’re on the cloud or plan to move there very soon.

Top Big Data Sessions for Data Summit 2015

Top Big Data Sessions for Data Summit 2015

If you want to know what the big players are doing with Big Data, then the Data Summit is the place to be. This Big Data conference, that will take place at the New York Hilton Midtown on May 11-13, will be attended by giants like Amazon, eBay, and even Pfizer and Mastercard. Dozens of engaging workshops will take place during the conference, so here are our picks top for the 10 sessions you can’t miss at Data Summit 2015.

11 Recommended Sessions for Collision 2015

11 Recommended Sessions for Collision 2015

What do U2’s lead singer Bono, pro skater Tony Hawk, and Netflix founder Reed Hastings have in common? Collision. Not a literal crash, but the huge tech conference that will take place in downtown Las Vegas May 5-6, 2015. Collision is related to Europe’s Web Summit and aims to bring all kinds of technology professionals together. Since there will be over 500 speakers at this year’s conference, you can’t go to everything. To help you get the most out of the conference, here are our favorite picks.

5 Platforms for Collecting Big Data

5 Platforms for Collecting Big Data

Everything comes as a service these days, and so does collecting Big Data. Various platforms on the web are happy to take data collection off your coding hands, making it easy for you to collect data from various sources in one location. Some call this a data hub. The following five platforms will help meet your ever-increasing data collection needs.

5 Real-time Streaming Platforms for Big Data

5 Real-time Streaming Platforms for Big Data

There are quite a few real-time platforms out there. A lot of them are newcomers, and the differences between them aren’t clear at all. The least we can do, is present all the options for you to choose from, so here are five real-time streaming platforms for Big Data.

Amazon Redshift Review 2015

Amazon Redshift Review 2015

Happy birthday to Redshift! Amazon’s data warehouse-as-a-service has just celebrated two years of data querying. Several reviews were written about Redshift at the time, but as far as we know, no one has looked back to check on what’s happening with the red giant since then. So, we went ahead and did a little checkup. Here’s our up-to-date Redshift review.

Spark, Impala, Tez and Hive: Interview with David Gruzman

Spark, Impala, Tez and Hive: Interview with David Gruzman

Big Data consultant David Gruzman answered some of our burning questions about which Big Data platform to use, whether streaming is a must or not, and what are the biggest issues with the cloud.

6 Lessons for Big Data Startups: Xplenty's 2014

6 Lessons for Big Data Startups: Xplenty's 2014

Last year, we recruited $3 million, got featured on TechCrunch, found new customers, hired more employees, attended conferences around the globe, spent thousands of hours on R&D, and invested a lot more effort on sales and marketing. Not everything was perfect, though. Now that 2014 is over and we have gained some perspective, here are six lessons that we learned as a Big Data startup.

4 Ways to Process Small Data with Hadoop

4 Ways to Process Small Data with Hadoop

One of the greatest Big Data myths, is that you need terabytes or even petabytes of data before you can use Hadoop. However, there are plenty of advantages to using Hadoop for small data. The only question that’s left is “how”.

Spark Talk: Interview with Big Data Specialist Uzy Hadad

Spark Talk: Interview with Big Data Specialist Uzy Hadad

Is Apache Spark as hot as you think it is? Although it shines so bright across the Big Data galaxy that some folks think that it may have killed the MapReduce star, Spark is still in its teenage years and has yet to fully mature as a platform. To find out the current state of Spark, we talked to Big Data specialist Uzy Hadad, founder of Inroid.

Top 5 Big Data Events

Top 5 Big Data Events

Every year, dozens of Big Data conferences take place all over the world, from San Francisco to Shanghai. Now that 2015 is finally here, it’s time to open up your smartphone calendar and mark in this year’s Big Data conferences. Here are our five favorite events.

Xplenty and Segment: Processing Your Customer Behavior Data

Xplenty and Segment: Processing Your Customer Behavior Data

With Segment and Xplenty, you can sort, filter and aggregate your customer behavior data without writing any code, or installing or maintaining anything.

Xplenty’s Data Processing Survey

Xplenty’s Data Processing Survey

We want to know more about the Big Data community: what causes them headaches? What makes them happy? Which tools and technologies do they use? When we had our booth at AWS re:Invent 2014, we met as many people as possible and talked about their data needs. To get an even better picture, we conducted a little survey on the side.

How to Collect, Transform, and Visualize Your Data on the Cloud

How to Collect, Transform, and Visualize Your Data on the Cloud

Data collection, data processing, data visualization, and reporting are becoming more and more challenging as data volumes grow and data formats change. Combined with the increased use of public cloud infrastructure, developers and data professionals need to find new strategies and adopt new technologies to design and maintain scalable and cost-effective data architectures.

Top 3 Updates from AWS re:Invent 2014

Top 3 Updates from AWS re:Invent 2014

Amazon had some great news at AWS re:Invent 2014 in Vegas—and we were there to hear it. Most of them were geared toward enterprises and developers, but three stood out for us more than the rest.

Our Top 5 Picks for AWS re:Invent Sessions on Big Data

Our Top 5 Picks for AWS re:Invent Sessions on Big Data

Next week we’ll be heading down to Vegas for AWS re:Invent 2014. We’re eager to meet the Big Data community and attend AWS sessions. There are quite a few of them—227 to be exact—so here are our top five recommendations.

GitHub, You Got Issues: An Analysis of Issues on GitHub in 2013

GitHub, You Got Issues: An Analysis of Issues on GitHub in 2013

Everybody has issues, and so do users and repositories on GitHub. That's why we decided to answer this year’s GitHub Data Challenge by heading where developers fear to tread and analyze GitHub issues in 2013.