Hadoop Best Practice – Set Up Your Cluster
Following on from the cluster checklist in the previous article, this post shows how we initially dimensioned and set up our cluster, providing a Hadoop stack in five main steps. Before we have a look...
View ArticleMonitoring Web Apps with Cucumber
I like cucumbers – delicious and almost no sugar and fat. But, hey, as a system operator I benefit from cucumber even at work. There are two aspects to keep in mind while thinking about how to monitor...
View ArticleCucumber goes Hadoop
In addition to my last post monitoring web apps with cucumber I’ll provide some more examples of how to use and extend the basic cucumber features. The installation of cucumber[-nagios] and creation of...
View ArticleHow to monitor HBase health with Nagios
Monitoring the availability of services and/or servers should be a basic request to any server environment providing various services. In addition to that, there’s often the need to display more...
View ArticleHadoop training by Cloudera
Last week I attended an admin training about Hadoop, held by Cloudera in a comfortable and well prepared location in London. This 3-day course covers several topics of the Hadoop ecosystem, all within...
View ArticleCase Study: Retail WiFi Log-file Analysis with Hadoop and Impala, Part 2
Following on from Jean-Pierre’s introduction to this experiment in part 1, I will now expand on the technical details of the data ingestion process using Flume. As you can see in figure 2 from the...
View ArticleReview of Berlin Buzzwords 2013
BerlinBuzzwords rocks ! It was a great two-day conference focussed on Search/Scalability/Big Data in the KulturBrauerei in Prenzlauer Berg – an awesome location. Ariel Waldman opened the conference...
View ArticleGeo-based tweet analysis to alert emergency services
Introduction The constantly rising number of Twitter tweets includes a massive amount of data – perfectly suited for analysis using algorithms and techniques of the Big Data and Machine Learning...
View ArticleHow to install Cloudera Manager and Cloudera Search with support from Ansible
The Cloudera Manager is a great tool to orchestrate your CDH based Hadoop cluster. You can use it from cluster installation, deploying configurations, restarting daemons to monitoring each cluster...
View ArticleDo you want to become a Data Analyst? YMC extends training portfolio
In addition to the Administrator for Apache Hadoop training, YMC now offers the Cloudera Data Analyst training course. This 3-day hands-on course is for anyone who wants to manage, manipulate, and...
View Article
More Pages to Explore .....