Benford’s law in our data

A few months back our team found out about an interesting statistical phenomena known as the Benford’s law. This law is an observation about the frequency distribution of leading significant digits in real-life sets of numerical data.

Data Engineering at BravoSystems: Main Concepts

We service approximately 1.2 million users daily, which between them produce 120 to 220 million events. Our cluster is currently a bit over 870TB. Our biggest event log generates around 120GB of data daily (without replication).