Show HN: See if a movie is happy or sad with sentiment analysis

etjossem · on May 5, 2017

One of the major problems with sentiment analysis is that it doesn't pick up on irony or subtext very well. Relying on word choice alone, one such analysis pinned "Fitter Happier" as one of the happiest Radiohead songs.

http://rcharlie.com/2017-02-16-fitteR-happieR/

Edit: sorry, I misremembered - it actually handled "True Love Waits" pretty well. It did pick up "Fitter Happier" as one of the happiest, that's what struck me as strange.

johnnydoe9 · on May 5, 2017

That says True Love Waits is the most depressing song though.. am I missing something?

>We have a winner! “True Love Waits” is officially the single most depressing Radiohead song to date. Rightly so, given that it tied for lowest valence (0.0378) and ranked fourth for highest percentage of sad words (24%). If the numbers still don’t convince you, just listen to it.

bberrry · on May 5, 2017

Everyone realizes there outliers difficult to get right. Point is you can get quite far with a relatively naive approach.

rullgrus · on May 5, 2017

Isn't "True Love Waits" detected as the most depressing? The happiest one is "15 steps".

btbuildem · on May 5, 2017

Interesting project, horrible presentation.

The colour gradient on the graph is confusing - does green mean happy, blue unhappy, and the movie changes tone over time? Axes of the graph are not labeled, what are we looking at? But, those concerns are secondary.

Bar graph seems a poor choice here, given the nature of the data. Given that there doesn't seem to be any correlation with time, the order of phrases doesn't seem important -- you could forgo the linear presentation, and display the distribution of the data instead.

For example, you could bucket the sentiment values (-3 to -2, -2 to -1, etc) and use a histogram to show the counts in each bucket. This would enable you to compare different movies (one histogram per movie).

eljs · on May 5, 2017

You're right about the graphs. The horizontal axis is the minutes of the movie though (on the second graph). Will get rid of the gradient and add labels!

fratlas · on May 6, 2017

I love the visualization! Is the website a template? What lib for the graphs?

vogt · on May 6, 2017

Just looks some Bootstrap and ChartJS.org.

aisofteng · on May 5, 2017

I'm always a little wary of the robustness of sentiment analysis; in my experience, if you take the time to check sentiment analysis results sentence by sentence, you will find a high error rate.

I haven't confirmed by looking at the source, but my suspicion is that either most sentiment analysis implementations are rule based or are not well tuned.

My go-to example is IBM Watson's sentiment analysis service rating "I hope you die" as very positive because the sentence is categorized as "hopeful". Which I suppose it is, technically, and perhaps this is a case where I'm expecting too much because recognizing this particular example as having a negative sentiment requires much human abstract reasoning and inference, but the example remains nonetheless because real-world language usage that isn't dry and technical is rife with these sorts of linguistic usages.

eljs · on May 5, 2017

OP here! I was watching the Stanford NLP classes a while back (https://www.youtube.com/playlist?list=PLiNErZ5Bus8qNxNsFZFkh...) and ended up trying the part about sentiment analysis on srt files. The way it works right is quite primitive but you can still see some "trends" on most movies. If there are any suggestions on how I could make this smarter I would love to hear them!

msapaydin · on May 5, 2017

Maybe you could use RNNs to do the sentiment analysis, either Recurrent or Recursive Neural Networks. Some sample code is available in the book "Tensorflow for machine intelligence" in github.

sawmurai · on May 5, 2017

Maybe if you get information about cut frequency you could identify parts that are more actionish / exciting than others.

sharemywin · on May 5, 2017

has anyone tried opensubtitles.org

just found it on google

curiousgal · on May 5, 2017

Try http://www.yifysubtitles.com/

vit05 · on May 5, 2017

Just sometimes, because is the only place that has subtitles for some old movies. But I personally prefer Subscene.com.

sharemywin · on May 5, 2017

never mind looks kinda scammy

icebraining · on May 5, 2017

Nah, it's fine, it tries to push you their downloader, but if you avoid the pitfalls the content is good. I've used it many times over the years.

gh1 · on May 5, 2017

I like it. What do you think of a recommendation engine which tries to recommend movies based on patterns of happiness and sadness?

snissn · on May 5, 2017

Why does the graph go from green to blue?

eljs · on May 5, 2017

Good question, I just thought it looked nice but maybe I shouldn't have

snissn · on May 5, 2017

Cool! I thought it was an indication of sentiment at first, I was like oh yeah la la land gets sad towards the end. Cool endeavor!

Curious if you investigated other libraries or algorithms. have you seen this - i have been playing with it, seems reliable: https://github.com/cjhutto/vaderSentiment

zoidb · on May 5, 2017

You should let people see what other people submitted although it's pretty easy by fiddling with the URL. I like seeing the word patterns but I really wonder how well it can detect true sentiment, not an easy thing to do. https://www.crealdo.com/story/movie-sentiment/movies/13

rodrigocoelho · on May 5, 2017

Also interesting: Character-to-Character Sentiment Analysis in Shakespeare’s Plays (PDF)

http://www.aclweb.org/anthology/P13-2085

AznHisoka · on May 5, 2017

What about a movie lile "Brazil"? would that be considered happy or sad?

eljs · on May 5, 2017

I don't know this movie but you can try it out by uploading the movies srt file.

cholantesh · on May 5, 2017

Brazil is a dystopian satire. I would wager that GP is asking about scenarios where there is significant ambiguity and/or situational/verbal irony.

eljs · on May 5, 2017

I just gave it a try: https://www.crealdo.com/story/movie-sentiment/movies/30 Never seen the movie so I can't judge myself.

blacksmith_tb · on May 5, 2017

That seems like a plausible analysis, the movie becomes darker as it proceeds, though there's a false happy ending (which in some cases of ham-fisted editing is the only ending - I saw it aired on network TV in the US this way) followed by a much more somber closing.

blackflame7000 · on May 5, 2017

It's broken. My sentiment of La La Land was nowhere near that cheerful lol. In all serious though, its pretty cool idea for data analytics.

huangc10 · on May 5, 2017

At the top, can you put a score or average of the analysis so we know if the overall movie is happy or sad? That number will be interesting :D

kingkawn · on May 5, 2017

Horrifying.

opticals · on May 5, 2017

Make it available in French!