Spotting the Fake Reviews: Deceptive Opinion Spam

October 09, 2017 鈥�

Computer Science

Linguistic, Behavioral and Temporal Signals Provide Clues

Thinking about trying that new restaurant that opened just down the street? Eye-balling that new phone that costs $1,000?

Arjun Mukherjee From picking a restaurant to buying the latest gadget to choosing a new doctor, people make a large number of their decisions based on reading reviews. Sometimes these reviews are an accurate reflection of what you鈥檒l get. Other times, these reviews fall short.

Consumer Economy Driven by Opinion

鈥淭he consumer economy is driven by opinions,鈥� said Arjun Mukherjee, assistant professor of computer science at the 91猎奇, whose research is focused on detecting deceptive opinion spam, or fake reviews. 鈥淰eracity of opinions is of paramount importance.鈥�

Spotting deceptive opinions, in the absence of contextual information about the reviewer鈥檚 background, can be quite tricky, a point that Mukherjee makes to his students every semester. One experiment that he often repeats is showing a group of students a pair of reviews, one that is real, one that is fake.

When asked which one is fake, their accuracy inevitably hovers around 50 percent, little better than sheer guesswork. Even after years of working in the field, Mukherjee鈥檚 success rate at spotting an individual fake review isn鈥檛 much better either.

鈥淛ust by looking at a review, it is very hard to tell which one is a fake,鈥� Mukherjee said.

Filtering Fake Reviews Dependent on Contextual Clues

Mukherjee鈥檚 research group constructs models that use contextual signals to spot fake reviews.

鈥淭here are a lot of signals you have to take into account,鈥� Mukherjee said.

These patterns are a little bit like a poker player鈥檚 tells: subtle signals that can help differentiate truth from deception but are never fully foolproof. With spotting fake reviews, this comes down to analyzing linguistic, behavioral and temporal signals.

Use of Language and Posting Patterns to Spot Deception

For example, if one reviewer posts a whole bunch of reviews all at once or a group of people review the same products in a short time frame, that鈥檚 a sign of deceptive opinions. Another signal, known as 鈥榖uffering,鈥� would be if there is an influx of positive reviews when the overall approval rate of a product is dropping, a strategy often used by review spammers to maintain a product鈥檚 rating.

Another signal would be behavioral. If reviews deviate from the norm, giving positive feedback when the majority are negative, or if an author uses duplicated content or only gives extreme ratings, these are also indications of potentially deceptive opinions.

Another clue is language. If a review鈥檚 descriptions are generic, without offering many specific details, that鈥檚 yet another sign of deception.

With Mukherjee鈥檚 models, all of these signals are taken into account, to spot patterns that suggest deceptive opinions.

鈥淚t鈥檚 a holistic model,鈥� Mukherjee said.

Suspicious Patterns Do Not Always Indicate Deception

But, as with any problem this complex, although the analysis can be sophisticated enough to filter out fake reviews that, at first glance, seem legitimate, there are no guarantees. There will always be genuine reviews that get flagged as fake, and there will always be fake reviews that escape detection.

鈥淎ll suspicious patterns may not indicate deception,鈥� Mukherjee said. 鈥淎fter all, we are individuals, we have different personalities. How one person evaluates a particular entity might look fishy or suspicious to me, but might not look that way to you.鈥�

This research is funded by the National Science Foundation. Results from this research have been presented at the Association for the Advancement of Artificial Intelligence Conference on Web and Social Media, the Association for Computing Machinery World Wide Web Conference, the Institute for Electrical and Electronics Engineers鈥� International Conference on Data Mining, as well as the International Conference on Intelligent Text Processing and Computation Linguistics.

- Rachel Fairbank, College of Natural Sciences and Mathematics

91猎奇

Department of Computer Science