[REF] ICS User Experience Graphs (sentiment analysis)

bedalus · Apr 17, 2012

I've decided to pull this project as it doesn't seem useful, or accurate! Feel free to pm if you have any ideas for me.

Where did the other benchmarks go?

All ICS ROM Benchmarks: this thread

Battery Drain Benchmarks: this thread

Kernel Features and Benchmarks: this thread

CPU Governors and I/O Schedulers: this thread

Power Saving Governors: this thread

Does SuperCharging work?: this thread

bedalus · Apr 17, 2012

Methodology

I wrote a program in C++ with several routines:

1) copy all the thread's html all into one file
2) throw away all the html code
3) throw away anything from a quote
4) throw away any one letter words
5) throw away all punctuation except apostrophes/exclamation marks/question marks/periods/full-stops (also add a full-stop if it was missing at the end of a post)

Then the entire thread is uploaded in chunks of 400 words to an API http://www.alchemyapi.com/api/sentiment/

This returns a sentiment score for each 400 word chunk. It can be either positive or negative. Since most users are polite when they have a criticism, the scores tend to range from slightly negative to very positive.

[Q] What is 'sentiment analysis'?
[A] Find out more here: http://en.wikipedia.org/wiki/Sentiment_analysis
- - - quote "Computers can perform automated sentiment analysis of digital texts, using elements from machine learning such as latent semantic analysis, support vector machines, "bag of words" and Semantic Orientation — Pointwise Mutual Information..." sourced from the above link.

Each 400 word block gets a score usually ranging somewhere between slightly negative and very positive. Each score forms the basis of my raw data.

I currently show this data as several graphs, but I may strip away some. The most useful graph (remember, my background is mathematics education...) is a combination of the entire thread's average score plus the average for the most recent 10%. This helps to highlight threads that have a history of good sentiment as well as continuing good sentiment.

Notes

Initial attempt (for posterity): http://xdaforums.com/showpost.php?p=24451874&postcount=712

bedalus · Apr 17, 2012

I couldn't have done this without the support of a few people in particular: tchaari (who first informed me of this marvellous field of sentiment analysis); harbb for joking around with me (which got me thinking about doing it for real); and glennkaonang for feedback and moral support. Special thanks to original21 who inspired me to provide the most recent data separately.

Thanks to the XDA community in general, the developers, and a special shout out to others who have been supportive in my previous works: CyberGR, simms22, morfic, krarvind, wildestpixel, kong, Oodie, steve.garon, brainmaster, mathkid95, DaXmax, AndroidUser00110001, hope I didn't I forget anyone

Thanks to anyone past, present and future who has any constructive criticism, or just hits my thanks button! It keeps me going! Thanks to the moderators for keeping me in check... and to google/samsung for the toys

rrohanjs · Apr 17, 2012

another bedalus special treat in store

DaXmax · Apr 17, 2012

I have been thinking, you should actually join the Recognized Contributor as you contributed alot...

Sent from my Nexus S using Tapatalk 2 Beta-5

tchaari · Apr 17, 2012

DaXmax said:
I have been thinking, you should actually join the Recognized Contributor as you contributed alot...

Sent from my Nexus S using Tapatalk 2 Beta-5

+1

Nice to see that bedalus benchmarks are back

Very interesting work bedalus. The results in the spreadsheet are not always significant but it's a very good start that deserves many encouragements. I am thinking about if taking two words before and two words after each term can improve the readability of the results...
I have also another UX idea: coding some program that can evaluate (approximately) if a post is a positive feedback, negative feedback, a simple question or a simple answer. Then, a final average score is computed.

bedalus · Apr 17, 2012

DaXmax said:
I have been thinking, you should actually join the Recognized Contributor as you contributed alot...

Thanks! I think I'm the most pleased with this one because it made me learn C++. I'm just wrapping my head around custom structs.

I've applied for the RC status. Don't know what the criteria are particularly, but I think I've produced some useful stuff. If I get it, I might order an XDA t-shirt, then my wife will be really concerned...

bedalus · Apr 17, 2012

tchaari said:
+1

Nice to see that bedalus benchmarks are back
Very interesting work bedalus. The results in the spreadsheet are not always significant but it's a very good start that deserves many encouragements. I am thinking about if taking two words before and two words after each term can improve the readability of the results...

I'm one step ahead of you this time tchaari, the code for two words before and after is halfway there...

tchaari said:
I have also another UX idea: coding some program that can evaluate (approximately) if a post is a positive feedback, negative feedback, a simple question or a simple answer. Then, a final average score is computed.

Isn't that what your brain is for?

EDIT: I'm just adding the link you PM'd me so I can find it more easily: http://stackoverflow.com/questions/...-how-positive-or-negative-a-statement-text-is

bedalus · Apr 17, 2012

UPDATE: Found the bug in my program that caused it to crash if there was only one page (with 50 posts per page) i.e. any young thread with less than 51 posts would fit on one page, and my program will only download the total number of pages -1 (it saves the last page for a new start page for when I update a thread). Fixed... now trying to get all the other threads.

tchaari · Apr 17, 2012

bedalus said:
I'm one step ahead of you this time tchaari, the code for two words before and after is halfway there...

Not really surprised. You are the top benchmark specialist here

bedalus said:
Isn't that what your brain is for

Lol, that's true but we will all feel better if the machine can handle a little more computing and analysis from what our brains are cooking every day

You should really take a look on what's going on in the "natural language processing (NLP)" domain. If your program can be connected to some existing tools like [1] and [2], the results can be so interesting

[1] http://khassanali-nlp-research.blogspot.com/2008/01/nltk.html
[2] http://kmandcomputing.blogspot.com/2008/06/opinion-mining-with-rapidminer-quick.html
These are other refs on sentiment analysis and opinion mining from the NLP domain if someone is interested:
- http://en.wikipedia.org/wiki/Sentiment_analysis
- http://eprints.qut.edu.au/29301/1/c29301.pdf
Bedalus, you made us addicted to your benchmarks. If you close one more of your thread, I'll go on a hunger strike with Oodie

Oodie · Apr 18, 2012

Wow ! This looks Promising

Let's see wht this gives us & yeah ! It was boring in NS forums without you . lol .

SA-07 · Apr 18, 2012

Welcome back sir

Sent from my Nexus S using XDA

jojoost · Apr 18, 2012

Good to be back? At least I'm happy you are back!

Greetzz, jojoost.

Sent from my Nexus S using Tapatalk 2

bedalus · Apr 18, 2012

Thanks for the kind words everyone! It's nice to be working on a thread again.

I got my program more stable. I found a more up to date version of wget for windows here by a guy called Oliver Krystal. It had the drivers built in.

I've removed the my program in the second post until I can locate the source of the instability. Last night I ran it and it managed to download most of the ROM threads, but crashed halfway through a long thread.

I'm going to tidy up the code and try and break it up into more manageable routines (most of the work is done in one long procedure at the moment, not very good practice...

) Then perhaps it'll be easier to debug.

When it's more stable I'll re-upload it.

@tchaari, maybe I do need to include some scoring algorithm, otherwise it's not really a benchmark is it! haha

glennkaonang · Apr 18, 2012

Glad to see you back, bedalus.
This time you bring more headache to us having no experience at all in information technology with your testing methodology

Regarding improvements it needed, I think it's good enough already.
But the idea of that algorithm thing would make it more interesting, although it would bring more headache to me I guess

Sent from my Nexus S using xda premium

bedalus · Apr 18, 2012

I changed a couple of things in the build, now it runs without crashing, so I re-uploaded the program to the second post (along with the spreadsheet).

The included 0threads.txt file currently includes all the 4.0.4 ROMs that were on my ROMs spreadsheet (didn't bother with 4.0.3s).

Any new ROMs I don't know about?

Does anyone want me to stick in any kernel threads? Theme threads? Threads from other phones?

STILL TO DO:
-Modify the program to update by starting to downloading threads at the point I last read
-See if I can implement a five word phrase
-See if I can find or create any language analysis program to score the phrases

DaXmax · Apr 18, 2012

bedalus said:
Thanks! I think I'm the most pleased with this one because it made me learn C++. I'm just wrapping my head around custom structs.

I've applied for the RC status. Don't know what the criteria are particularly, but I think I've produced some useful stuff. If I get it, I might order an XDA t-shirt, then my wife will be really concerned...

Lol. You contributed alot for the Nexus S development. Im sure the Senior Mod, will nominate you, if not, i will do it....

bedalus · Apr 18, 2012

DaXmax said:
Lol. You contributed alot for the Nexus S development. Im sure the Senior Mod, will nominate you, if not, i will do it....

Cool! Thanks!

bedalus · Apr 18, 2012

UPDATE: All 4.0.4 ROMs (that I currently know of) now in the spreadsheet. Just working on the index.

Jamalsid · Apr 18, 2012

Another quality thread in the making I think!

Good to see you back I was getting worried when all your threads were closing.

[REF] ICS User Experience Graphs (sentiment analysis)

Which threads are the most useful to analyse?

ROM threads

Kernel threads

Theme/App threads

General threads

Threads for other devices

bedalus

Guest

bedalus

Guest

bedalus

Guest

Senior Member

Senior Member

Senior Member

bedalus

Guest

bedalus

Guest

bedalus

Guest

Senior Member

Inactive Recognized Developer

Inactive Recognized Developer

Senior Member

bedalus

Guest

Senior Member

bedalus

Guest

Senior Member

bedalus

Guest

bedalus

Guest

Senior Member

Similar threads

Top Liked Posts