Most Self-Described Savvy Internet Users Cannot Spot AI Bots on Social Media

Nearly half of participants in a structured experiment failed to correctly identify AI-generated social media bots more often than they misidentified real humans - a finding that challenges the widespread assumption that digital literacy provides meaningful protection against automated deception. The study, conducted by cybersecurity company Surfshark in collaboration with master's students at Malmö University, tested 710 participants on their ability to distinguish between AI bots and genuine human accounts. Only 53 percent cleared the basic threshold of correctly flagging bots more frequently than they wrongly flagged people.

What the Numbers Actually Reveal

The raw figure - 53 percent performing above the benchmark - sounds like a slim majority getting it right. But the framing matters. That number means 47 percent of participants, a group that presumably included many self-confident internet users, performed at or below a level that would be expected by chance or simple guesswork. In a controlled environment with no time pressure and no social stakes, nearly half still could not consistently tell automated accounts from human ones.

The significance deepens when you consider the population being tested. These were not random members of the general public with little exposure to online platforms. Participants were drawn from a university context, and many likely regarded themselves as digitally competent. That self-perception - a sense of being too informed to be fooled - is precisely what makes the results sobering. Overconfidence in one's ability to detect manipulation is itself a vulnerability.

Why AI Bots Have Become So Difficult to Detect

The credibility gap between human and bot behavior on social media has narrowed substantially as large language models have grown more capable. Early bots were easy to identify: repetitive phrasing, implausible posting schedules, no coherent personality across posts, accounts with minimal history. Current AI systems can generate contextually relevant responses, mimic conversational tone, reference trending topics in real time, and maintain the appearance of an ongoing, evolving online identity.

Profile construction has also grown more sophisticated. Bots can now be seeded with fabricated biographical details, profile images generated by AI, plausible follower networks, and posting histories that simulate organic engagement over time. On platforms where verification is limited and volume is high, the cognitive load required to scrutinize each account is simply too great for most users to sustain - even those who know to be skeptical.

There is also an asymmetry of effort. A single coordinated bot network can produce thousands of interactions per day across multiple platforms. The human trying to assess what is real must evaluate those interactions one by one, in real time, while also processing content, forming opinions, and engaging with others. The architecture of social media is not built to help users slow down and audit what they see.

The Broader Security and Policy Stakes

The inability to reliably detect AI bots is not merely a curiosity about human perception. It has direct consequences for how information spreads, how opinions form, and how platforms are used as tools of influence. Bot networks have been documented in the context of election interference, public health misinformation, and coordinated harassment campaigns. If even digitally literate users cannot spot automated accounts, the effectiveness of awareness campaigns and media literacy education as standalone defenses comes into serious question.

From a cybersecurity standpoint, bots on social media represent a social engineering vector as much as a technical one. They can be used to build false consensus around a position, to amplify fringe content until it appears mainstream, or to direct users toward phishing links embedded in seemingly authentic interactions. The threat is not abstract: it operates through the same trust mechanisms that make social platforms feel meaningful in the first place.

Regulatory responses have so far lagged the pace of the problem. Some jurisdictions have introduced disclosure requirements for automated accounts, but enforcement is inconsistent and definitions remain contested. Platform-level detection systems exist but face the same challenge that human users do - sophisticated bots are designed specifically to evade them. The Surfshark experiment adds to a growing body of evidence suggesting that technical and policy solutions will need to do the work that individual vigilance demonstrably cannot.

Rethinking Digital Literacy in an AI-Saturated Environment

The experiment raises a pointed question about what digital literacy is actually supposed to accomplish. Teaching users to look for warning signs - an unusual posting pattern, an account with few followers, generic profile language - was always a partial solution. Those heuristics were calibrated to older, cruder bots. Against current AI-generated accounts, they offer diminishing returns.

A more honest framing of digital literacy in this environment might start with acknowledging that individual identification is an unreliable defense. Users would be better served by understanding the structural conditions that make bots effective - platform incentives that reward engagement volume over authenticity, verification systems that remain optional or easily circumvented, and the speed at which content moves relative to the time required to assess it critically. Knowing that you probably cannot tell the difference, and adjusting your trust accordingly, may be more protective than believing you can.

Surfshark's experiment is a small window into a large and accelerating problem. The 710 participants who took part were not unusual in their failure. They were representative of what happens when human judgment encounters a system engineered specifically to defeat it.