Quantcast
Channel: YMC » Gerd König
Browsing latest articles
Browse All 10 View Live

Hadoop Best Practice – Set Up Your Cluster

Following on from the cluster checklist in the previous article, this post shows how we initially dimensioned and set up our cluster, providing a Hadoop stack in five main steps. Before we have a look...

View Article



Monitoring Web Apps with Cucumber

I like cucumbers – delicious and almost no sugar and fat. But, hey, as a system operator I benefit from cucumber even at work. There are two aspects to keep in mind while thinking about how to monitor...

View Article

Cucumber goes Hadoop

In addition to my last post monitoring web apps with cucumber I’ll provide some more examples of how to use and extend the basic cucumber features. The installation of cucumber[-nagios] and creation of...

View Article

How to monitor HBase health with Nagios

Monitoring the availability of services and/or servers should be a basic request to any server environment providing various services. In addition to that, there’s often the need to display more...

View Article

Hadoop training by Cloudera

Last week I attended an admin training about Hadoop, held by Cloudera in a comfortable and well prepared location in London. This 3-day course covers several topics of the Hadoop ecosystem, all within...

View Article


Case Study: Retail WiFi Log-file Analysis with Hadoop and Impala, Part 2

Following on from Jean-Pierre’s introduction to this experiment in part 1, I will now expand on the technical details of the data ingestion process using Flume. As you can see in figure 2 from the...

View Article

Review of Berlin Buzzwords 2013

BerlinBuzzwords rocks ! It was a great two-day conference focussed on Search/Scalability/Big Data in the KulturBrauerei in Prenzlauer Berg – an awesome location. Ariel Waldman opened the conference...

View Article

Geo-based tweet analysis to alert emergency services

Introduction The constantly rising number of Twitter tweets includes a massive amount of data – perfectly suited for analysis using algorithms and techniques of the Big Data and Machine Learning...

View Article


How to install Cloudera Manager and Cloudera Search with support from Ansible

The Cloudera Manager is a great tool to orchestrate your CDH based Hadoop cluster. You can use it from cluster installation, deploying configurations, restarting daemons to monitoring each cluster...

View Article


Do you want to become a Data Analyst? YMC extends training portfolio

In addition to the Administrator for Apache Hadoop training, YMC now offers the Cloudera Data Analyst training course. This 3-day hands-on course is for anyone who wants to manage, manipulate, and...

View Article
Browsing latest articles
Browse All 10 View Live




Latest Images