I tried some Hegel, who scored 0.18. Clearly broken.
Schopenhauer got it right, where BlaBlaMeter gets it wrong:
"If I were to say that the so-called philosophy of this fellow Hegel is a colossal piece of mystification which will yet provide posterity with an inexhaustible theme for laughter at our times, that it is a pseudo-philosophy paralyzing all mental powers, stifling all real thinking, and, by the most outrageous misuse of language, putting in its place the hollowest, most senseless, thoughtless, and, as is confirmed by its success, most stupefying verbiage, I should be quite right. Further, if I were to say that this summus philosophus ... scribbled nonsense quite unlike any mortal before him, so that whoever could read his most eulogized work, the so-called Phenomenology of the Mind, without feeling as if he were in a madhouse, would qualify as an inmate for Bedlam, I should be no less right."
I copied that text into the meter, got the .26 you saw. I then listened in to a conversation at the cube next to mine and wrote that in. "Using software as a service we can replicate and evergreen in the cloud." One simple sentence jumped it to a .35.
YES. Someone needs to make a voice-to-text app that has a "bullshit meter" needle, and that beeps rapidly (a la PKE meter) when a threshold is exceeded.
You're just bitter that that era of philosopher in general used an artificially complicated writing style, complete with invented terminology, that borders bullshit :)
> Bullshit Index :0.11 Your text shows only a few indications of 'bullshit'-English.
definitely broken
Or perhaps it's a feature and it can pick out philosophical / artistic 'bullshit' from PR 'bullshit'.
Quote for people not familiar with 'Finnegan's Wake'
> And aroud the lawn the rann it rann and this is the rann that
Hosty made. Spoken. Boyles and Cahills, Skerretts and Pritchards,
viersefied and piersified may the treeth we tale of live in stoney.
Here line the refrains of. Some vote him Vike, some mote him
Mike, some dub him Llyn and Phin while others hail him Lug
Bug Dan Lop, Lex, Lax, Gunne or Guinn. Some apt him Arth,
some bapt him Barth, Coll, Noll, Soll, Will, Weel, Wall but I
parse him Persse O'Reilly else he's called no name at all. To-
gether. Arrah, leave it to Hosty, frosty Hosty, leave it to Hosty
for he's the mann to rhyme the rann, the rann, the rann, the king
of all ranns. Have you here? (Some ha) Have we where? (Some
hant) Have you hered? (Others do) Have we whered (Others dont)
It's cumming, it's brumming! The clip, the clop! (All cla) Glass
crash. The(klikkaklakkaklaskaklopatzklatschabattacreppycrotty-
graddaghsemmihsammihnouithappluddyappladdypkonpkot!)
Try the following text, which I pulled from a brand management web site:
We help businesses increase profitability by helping develop a strategic market approach and communicate the right message.
Our approach is designed for those serious and committed to looking inward so they can connect outward for greater results in an expedient manner.
We analyze your business goals, assess your marketing needs to accomplish them, create a strategic plan and manage its implementation, leaving owners and managers to tend to other needs of their business.
Excerpt is cited as fair use, namely the educational value in evaluating a tool with three sentences of buzzword heavy english from a real site. Specific citation info omitted only so they won't be embarrassed, but may be found with Google.
The main reason I posted was in the hope of triggering discussion over the method used to analyse the text.
I've seen users getting a lower score simply by separating out a block of text into numbered paragraphs which would seem to point to quite a simplistic method.
If I were to make the software, the corpus of PR, licenses, etc. would be the way I go. But "they did it statistically" doesn't answer the question "what is the model?" There are many different statistical models one could use. My other post has a few things we've figured out.
But I'm starting to think a rule-based lexicon isn't out of the question, given these >1 scores on some texts.
I tried the german version of it and pasted a SAP article, which got a score of 1.34. They mostly talk bullshit, but the article wasn't bad to understand imho, so I think the algorithm needs some work.
The text was:
"Möchten Sie SAP-Software vor Ort installieren oder über die Cloud darauf zugreifen? Wir bieten in jedem Fall umfassende Services, zugeschnitten auf Ihre individuellen Anforderungen. Wir verfügen über eines der größten Expertenteams weltweit. Unsere qualifizierten Mitarbeiter beraten Sie gern bei der Konzeptionierung, Implementierung und Optimierung Ihrer Systemlandschaft. Profitieren Sie innerhalb kürzester Zeit von Ihrer SAP-Lösung."
I just pasted some text from a random TechCrunch article relating to Spotify and got this:
Your text: 847 characters, 145 words
Bullshit Index :0.56
Something's fishy. Obviously you want to sell something, or you're trying to impress somebody. Are you sure that you have a real message, and if so: who would understand it?
Lincoln (Gettysburg): 0.09
Lincoln (Second Inaugural): 0.09
MLK (I Have A Dream): 0.08 (Lowest of any Political text I tried)
Someone else said Churchill got a 0.08, but I didn't actually try the unaltered text for myself.
Here's a few things I've gleamed from experimenting with it:
- It uses a unigram language model. You can take the same text, randomly permute the words, and you get the same score. This means it also can't be using things like POS tagging, phrases, etc.
- It normalizes words by making all letters lowercase. The exact same text in all upper case has the same score.
- The score is eventually normalized by the length of the text. The same text copied multiple times gets the same score.
- It does not form a valid probability distribution, as someone's managed to get some 1.16's. This makes me believe it's not a Naive Bayes classifier giving you the P(Bullshit|Text). Though this is what I originally thought it would be.
It is that simple. Looks like they assign a BS level to words, and then take some sort of average bullshit level amongst all words. Word order doesn't matter as slashcom says.
For example, if you take the score from the Oracle Pricing blurb posted by BitMistro, and change 'strategies' to 'goals', you drop down to 0.8 or so. If you add an extra random 'strategy' somewhere, it bumps up to 1.4 or something.
I actually suspect a bug on their pair for strategies... probably a decimal error when building to BS level tables.
But similar things happen with other 'bullshity' words, just to a lesser degree.
It should also be noted that on about 400 short texts (~300 words each), it did not correlate with the Flesh-Kinaid readability measure at all. So it's not measuring something like average word length or syllable counts.
But it's not QUITE a true lexicon, as it handles Out-Of-Vocabulary words quite strangely. If you use as input text:
"PR-Experts, politicians, ad writers or scientists need to be strong here!
BlaBlaMeter unmasks without mercy how much bullshit hides in any text.
A useful tool for everyone involved in writing!
Simply copy your text into the white field and check your writing style. It works with english text up to 15.000 characters (overhead will be cut off). For a meaningful result we recommend a minimum length of 5 sentences."
Then you get 0.16. If you replace the last word 'sentences' with 'strategy' you go up to 0.44. However, if you change the last word to 'sentstrategyences' you get 0.47. Try it: you can basically insert 'strategy' inside ANY word and really raise your score. Actually, if you just insert "strateg" anywhere inside the text, it goes up massively.
So I actually think it's just doing string search counts over a lexicon.
"Politics are great, come buy our new, brand spanking awesome banana phone, apple, steve jobs, cripplingly epic banana phone. Just great phones, with bananas, no apples to be found here. Samsung can suck on our banana phone. Android is better than iOS."
"Your text: 251 characters, 43 words
Bullshit Index :0.03
Your text shows no or marginal indications of 'bullshit'-English."
You wrote a lot of bullshit but your text looks pretty normal from a vocabulary standard. looking a bit more to the website you'll see that what it calls bullshit english is that pattern often used in scientific articles or law texts (and president speeches) where they seem to be saying a lot of really wow stuff but all you're really left in the end as a big "?" cause you couldn't get half of what the person said.
I'd guess so.
Or maybe they compiled a corpus of bullshit text (scientific articles, PR and political texts), created a little statistical model and are using that to check the level of bullshit of your text.
The actual content parts hover around 0.16 (although I only tried segments without any equations). It makes me wonder what sort of results you'd get if you plotted this BS metric vs. page in a full length book.
1. "You probably want to sell something, or you're trying to
impress somebody. It still may be an acceptable result
for a scientific text."
I tried that too, along with Paul Ryan's speech (which got 0.14 - Your text shows only a few indications of 'bullshit'). Then, I tried the copy on my landing page and got a 0.25. It's shameful to realize that I'm a bigger bullshitter than two politicians.
If you brazenly lie using strong phrases without any weasel words, it'll come across as not being 'bullshit' grammatically. I don't think the software has a fact checker.
TechCrunch: 0.42 ("Amazon Wants Everyone To Know The Kindle Fire Is Sold Out")
Paul Ryan's RNC speech: 0.14
NYTimes: 0.11 ("Storm’s Winds Slow as It Exits Southern Louisiana")
Obviously its an apples and oranges comparison here but interesting (to me anyway) nonetheless
(which is the first writing of his that I ever read) appears to reach the maximum length (by character count) that the program will evaluate, and comes out like this:
"Your text: 15000 characters, 2698 words
Bullshit Index :0.09
Your text shows no or marginal indications of 'bullshit'-English."
Maybe Paul's procedure of having friends look over his essays and give suggestions helps cut out the bullshit.
> Bullshit Index: 0.31 – Your text shows indications of 'bullshit'-English. It's still ok for PR or advertising purposes, but more critical audiences may be skeptical.
> Bullshit Index: 0.4 – Something's getting a bit fishy. You probably want to sell something, or you're trying to impress somebody. It still may be an acceptable result for a scientific text.
The result is that non-German speakers are standing in the dark, wondering what methodology is behind that software.
It might be even difficult to translate the concept to English, as some terms and ideas like http://de.wikipedia.org/wiki/Nominalstil are missing in the (restricted) English language. German is so much more elaborated, its a language of philosophers ;-)
High-quality journalistic texts are usually in the range of 0.1 and 0.3
* Are there any texts with an indexof 0.0?
This is rare, but occurs. Too low of an index is rather suspicious though, and might also indicate stilistical deficits.
* What is the highest index value?
Thile the BlaBla Meter was designed for index values between 0 and 1, but in rare cases the scale can be exceeded. The highest measured values so far have been over 2.0 even.
* Does Google use a similar algorithm?
We don't know, but if we were Google, we'd probably try! Our samples for highly competitive search terms show that highly ranked sites often show good index values.
* How does BlaBla Meter work?
BlaBla meter checks texts for various linguistic traits, e.g. it checks for exceeding use of Nominalstil[1]. In addition, the text is checked for various phrases [buzzwords] with a certain weighting. We don't want to reveal the secrets though ;)
[1] Heavy use of "nominal style" is a German particularity. You can replace most verbs by nouns, adding suffixes like '-ion' or '-age', and transform the sentence into passive. This is often found in legal texts, where the author doesn't want to appear as a suject.
* Why does my scientific text have such a high index?
Nominalstil has crept into scientific language at large. This often serves to 'beat around the bush' - in this regard, there are analogies to typical PR tongue.
* My mindless text gets a good score, why?
BlaBla Meter can't really understand the topic, it looks after linguistic traits. While rainbow press can be attributed all kinds of substantial weaknesses, their language is usually short and concise.
* My text is wrongly devalued!
As every computer algorithm, BlaBla Meter might be at fault - in doubt, human beings must decide of course. Generally the hit rate is quite large, so you should usually tweak the wording upon receiving high indices.
* But humans read texts completely differently ...
When BlaBla Meter crys havoc, one can usually assume that a human reader is alerted too. People are trained every day to tell authentic messages from artifical advertising messages - it'd be naive to belive they couldn't!
* What happens with entered texts?
Texts are private matter of course, and are neither saved nor processed in any way or shape.
Yeah, personally I also score around .16 to .18 on my internet texts. Long papers get around 0.2, so I'm probably losing momentum after the fourth page :-/
"We use innovative cloud-based patent-pending technologies to leverage synergies between verticals. In order to facilitate the maximum return on investment, we hyperfocus on execution and delivery. We find our clients often fail to think outside the box, but with our proprietary technologies we help to identify action items that directly impact the bottom line."
Bullshit Index: 0.46
Something's getting a bit fishy. You probably want to sell something, or you're trying to impress somebody. It still may be an acceptable result for a scientific text.
Pretty good figuring it's under the 5 sentence recommendation.
I tried past cover letters for jobs and they all scored in the 0.5-0.6 range. This makes sense since I AM trying to sell myself in an honest but tailored way. Does this reveal a flaw in my writing style or in the purpose of cover letters in the hiring process? Is it even possible for a cover letter to score low on a bullshit scale?
Just tried to same with a recent cover letter, scoring 0.44. I wouldn't worry, although you (and me) can always try to bring it to a more down-to-earth level, making text easier and more pleasant to read.
Unfortunately, a job application isn't the right domain for some scientific analysis on correlation between bullshit-measurements and interview invitations, if you're really interested in the job :-)
Hilarious! I plugged in some paragraphs from press releases and it was spot on. I also appreciated the "or Scientist" part ... simply awesome!
"Bullshit Index :0.64
This reeks. I bet you're a PR-Expert, Politician, Consultant or Scientist. If there is a message, it's unlikely it will reach anyone."
Heh .. try putting in the first para from a scientific paper. I bet 10 karma points that this will have higher BS content than the rest. Similarly, the last para of the intro is also almost always content free.
Note: I'm a scientist too ... good to poke fun at ourselves, eh? :)
Joel on Software's "Software Inventory"[1] gets a 0.22 rating. I'm a big fan of Joel on Software and am curious as to how this model evaluates "bullshit".
I just tried some text from one of our corporate announcements. Seems accurate.
0.8
This reeks. We bet you're a PR-Expert, Politician, Consultant or Scientist. If there is a message, it's unlikely it will reach anyone. Maybe you should spend less effort on trying to impress somebody.
4. Small-Business tax breaks - [major bullshit] 0.5 [a]
1. Space Race - [PR level bullshit] 0.33
7. Money in Politics - [Some bullshit] 0.28
2. Internet Freedom - [some bullshit] 0.23
10. Work/Life Balance - [little bullshit] 0.16
9. Jobs / recent grads - [little bullshit] 0.16
6. Most Difficult Decision/afghanistan - [No Bullshit] 0.07
[Un-analyzable]
3. Favorite Basketball Player - too short
8. White House Beer Recipe - too short
5. First Activity on Nov 7 - too short
[a]"Are you sure that you have a real message, and if so:
who would understand it?"
I tried this on Paul Ryan's RNC address from last night, Barack Obama's innaugarel address, and a recent IEEE TKDE journal article. It gave bullshit measures of 0.15, 0.13 and 0.25, respectively. :-(
>Bullshit Index :0.26 Your text shows some indications of 'bullshit'
some indications ... I'd say this thing is broken.