Peter Dikant
Managing distributed Solr Servers
We use the open-source search server Solr for real-time search on data stored in a Hadoop cluster. For our terabyte-scale dataset, we had to...
Realtime Search for Hadoop
In the previous part of this article series we focused on the efficient storage of log data in Hadoop. We described how to store...
Storing log messages in Hadoop
In part 1 of this article series we described the various challenges of dealing with large amounts of logging data in a heavily distributed...
See You, SQL – Hello Hadoop
Our team has developed a system for storing and processing huge amounts of log data using Hadoop. The challenge was to handle Gigabytes of...