For all the yelling Frank and Estelle Costanza did on Seinfeld, the ever-present 1990s sitcom, they weren’t the angriest characters on the show, according to an algorithmic analysis from CompassRed’s Data Lab project.
Data scientist Daniel Larson had an algorithm scour through the 53,629 lines of text that make up the show’s 177 original episodes and sort them by sentiment (similar to this analysis of President Donald Trump’s tweets.)
The results from the analysis are broken down in a blogpost penned by Larson and Technical.ly Delaware alum Joey Davidson. A trio of interactive infographics lets users explore the data by character, match up season against season or see which of the main four characters had more lines through the years.
“The algorithm in this analysis moved through every spoken line in the show and scored the words used in a host of categories,” the post reads. “Each line might include words that the algorithm deemed indicative of fear, joy, trust, surprise or sadness. The line would also have an overall positive or negative emotional score.”
The project leveraged something called the NRC emotion lexicon (basically a list of words with associated emotions and sentiments) and an R package called tidytext to quantify the sentiment in each line of dialogue.
Yada, yada, yada: So who was the angrier character in the show’s nine-season run? None other than outrageous, egregious, preposterous attorney Jackie Chiles.
See the studyThere might be a business application for the lil experiment, the duo said.
“Think of a data science team that moves through a dataset like, say, tweets about a given business or product with a similar algorithm in hand,” Larson and Davidson wrote. “Gleaning customer emotion from the huge dataset they create by using social media on a massive scale would offer intangible value to the decisions and movements of business.”
Before you go...
To keep our site paywall-free, we’re launching a campaign to raise $25,000 by the end of the year. We believe information about entrepreneurs and tech should be accessible to everyone and your support helps make that happen, because journalism costs money.
Can we count on you? Your contribution to the Technical.ly Journalism Fund is tax-deductible.
Join our growing Slack community
Join 5,000 tech professionals and entrepreneurs in our community Slack today!