How Google Mesa Works (Short Summary)
Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google's Internet advertising business.
5 posts
Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google's Internet advertising business.
Key design considerations for storing time series data, including access pattern analysis, windowed storage strategies, and the trade-offs between granularity and performance.
Lessons learned from migrating a large Hadoop cluster over a single weekend -- planning, data migration with distcp, code migration of 300+ Oozie jobs, and HBase migration.
Notes from the book SBT in Action covering Maven concepts, SBT basics, tasks, settings, plugins, and build definitions.
A comparison of the new HDFS centralized cache management feature in Hadoop 2.3 with Spark RDDs, and why Spark still held the edge for in-memory processing.