We’ve uploaded a brand new screencast which shows how to process web server logs with Xplenty.
Welcome to Xplenty's Blog
All things data
Although the Internet made the world flat, geography still matters. Knowing which countries your users live in could provide business opportunities to localize your services and increase profits. The only question, is how in the world to do it.
Big Data brother is watching - whenever users surf your website, their browser sends an HTTP header called ‘User Agent’. It tells your web server which browser they’re using, in which version, and on which operating system. The user agent string is logged by the web server and can be later analyzed to find out, for example, how many users still surf your website in old IE versions and whether you should support them or not.
Papertrail is a cloud-hosted log management service. We use it internally for business insights and eat our own dogfood by processing the logs on Xplenty, our data integration platform on the cloud. Let's see how easily you could also integrate logs collected by Papertrail with Xplenty for analysis and aggregation on the cloud.
Readers of our blog should know by now that Apache Hadoop is great for offline batch processing of Big Data. But what about online streaming data? What if you’re running a ticker for the stock exchange or a real-time analytics dashboard? You might think that collecting streaming data is only relevant for big enterprises, but you don’t have to be The New York Stock Exchange to collect real-time data. Before you jump into the stream, here are 4 tips to get you started.