Software Development

Who’s the angriest character on ‘Seinfeld’?

Data scientist Daniel Larson had an algorithm scour all 53,629 lines of dialogue on the '90s sitcom and sort them by sentiment. The results might surprise you.

Tom's Restaurant in Morningside Heights, New York, where many of the show's scenes take place. (Photo by Flickr user Steve Harris, used under a Creative Commons license)

For all the yelling Frank and Estelle Costanza did on Seinfeld, the ever-present 1990s sitcom, they weren’t the angriest characters on the show, according to an algorithmic analysis from CompassRed’s Data Lab project.

Data scientist Daniel Larson had an algorithm scour through the 53,629 lines of text that make up the show’s 177 original episodes and sort them by sentiment (similar to this analysis of President Donald Trump’s tweets.)

The results from the analysis are broken down in a blogpost penned by Larson and Technical.ly Delaware alum Joey Davidson. A trio of interactive infographics lets users explore the data by character, match up season against season or see which of the main four characters had more lines through the years.

“The algorithm in this analysis moved through every spoken line in the show and scored the words used in a host of categories,” the post reads. “Each line might include words that the algorithm deemed indicative of fear, joy, trust, surprise or sadness. The line would also have an overall positive or negative emotional score.”

The project leveraged something called the NRC emotion lexicon (basically a list of words with associated emotions and sentiments) and an R package called tidytext to quantify the sentiment in each line of dialogue.

Yada, yada, yada: So who was the angrier character in the show’s nine-season run? None other than outrageous, egregious, preposterous attorney Jackie Chiles.

(Image via Giphy)

(Image via Giphy)

See the study

There might be a business application for the lil experiment, the duo said.

“Think of a data science team that moves through a dataset like, say, tweets about a given business or product with a similar algorithm in hand,” Larson and Davidson wrote. “Gleaning customer emotion from the huge dataset they create by using social media on a massive scale would offer intangible value to the decisions and movements of business.”

Companies: CompassRed
34% to our goal! $25,000

Before you go...

To keep our site paywall-free, we’re launching a campaign to raise $25,000 by the end of the year. We believe information about entrepreneurs and tech should be accessible to everyone and your support helps make that happen, because journalism costs money.

Can we count on you? Your contribution to the Technical.ly Journalism Fund is tax-deductible.

Donate Today
Engagement

Join our growing Slack community

Join 5,000 tech professionals and entrepreneurs in our community Slack today!

Trending

The looming TikTok ban doesn’t strike financial fear into the hearts of creators — it’s community they’re worried about

Influencers are news distributors now: Inside Technical.ly’s Creator in Residence Program

Unlocking the US healthcare market: What global startups need to know

These fulltime VR creators show Horizon Worlds isn't just for kids

Technically Media