At cafehound.com, we endeavor to locate the best coffee in the world. Over the last eight years we’ve happily watched as globally, the options available to the public have exponentially increased and the public’s general awareness of specialty coffee has deepened. Although we still believe that tracking down the best coffee in the world is central to our mission, we recently decided to dip our toes into the area of recommending specific coffee(s) to coffee lovers based on a mixture of qualitative and empirical analysis.
In two posts (1 and 2) from 2015, we took verbal reviews of specialty coffees from the site coffeereview.com, and we employed various clustering algorithms to discover groupings of coffee (based on words used to describe them and other factors). This served as our initial foray into using Data Science on expert coffee reviews to improve our understanding of specialty coffee.
Over the past month, we’ve set out to improve upon that original work in order to empower java lovers to discover the perfect brew. Our years of cupping coffee and talking with experts have shown that – after a certain point – what constitutes a “good cup of coffee” is subjective and specific to the palette of the beholder.
With that in mind, cafehound.com chose to use a large, multiyear list of coffee reviews from Kenneth David’s coffeereview.com site to explore the relationship between the descriptions used to rate coffee aroma, flavor, aftertaste, body, acidity and finish. We hypothesized that there are distinct groupings of coffee based on their roast profile, body, and flavors that are relevant to informing consumer preferences in the overall marketplace. To clarify, a market segmentation based on a representative sample of surveyed consumer preferences may be more useful to marketing professionals, but that is outside of the scope of this post. Instead, we’re using the structure inferred from math and reviews of specific coffees to estimate categories of the potential “coffee experience.” These categories may provide coffee consumers with guideposts for exploring new specialty coffees.
Our results led to six broad categories of coffee that we’ve ordered from lightest to darkest roast (based on average Agtron ratings). Agtron ratings are a numerical representation of the consistency of the roast color (lower numbers indicate a darker roast <45, higher numbers indicate a lighter roast 50+). More than the roast determines the flavor profile and overall body of the coffee, which is why some of these segments may appear similar.
Initially, we bring this content to you via occasionally updated web pages. Depending on demand, we may scale our service to provide daily or weekly recommendation updates.
For now, follow the link below to Find Your Coffee.
For code share: