Grier Johnson worked as a platform engineer and site reliability engineer at LinkedIn where he helped build out their metrics collection and alerting systems. Some key lessons he learned include: (1) don't reinvent the wheel and leverage existing open source solutions for metrics stores and alerting, (2) plan for redundancy and distribute stores close to data sources, and (3) make metrics discovery and customizing alerts and graphs easy for users.