Hackers Exploit Meta AI Assistant to Hijack Prominent Instagram Accounts

How a conversational loophole in Meta's AI assistant bypassed two-factor authentication, exposing the dangers of automated customer support.

June 2, 2026

Hackers Exploit Meta AI Assistant to Hijack Prominent Instagram Accounts
A major security crisis has engulfed Meta after a severe vulnerability in its artificial intelligence support assistant allowed hackers to hijack high-profile Instagram accounts through simple conversational manipulation. By asking the automated support chatbot to change the registered email address associated with target accounts, malicious actors managed to bypass two-factor authentication systems and gain full administrative control. The breach targeted prominent public figures, government-affiliated accounts, and major global brands[1][2]. Although Meta rapidly deployed a hotfix once security researchers exposed the flaw[2][3], the incident has raised urgent questions about the safety of delegating sensitive administrative actions to autonomous generative AI agents[4][5].
The fallout from the exploit became highly visible as a diverse array of prominent Instagram pages suddenly fell under hacker control[2]. Among the most notable targets was the official White House account established during the Barack Obama administration, which began posting unusual content[1][3]. Other victims included the personal account of the Chief Master Sergeant of the United States Space Force, John Bentivegna, and the official corporate handle of multinational cosmetics retailer Sephora[1][2]. Alongside these celebrity and institutional pages, hackers aggressively targeted highly coveted "OG" handles, which are short, sought-after usernames consisting of only a few letters or common words[1]. Because these rare handles fetch six-figure sums on underground digital forums, cybersecurity investigators tracked a flurry of immediate resale activity on messaging platforms like Telegram[1][2]. According to intelligence reports from prominent dark web researchers, the combined market value of just two compromised handles exceeded one million dollars, turning the technical vulnerability into a highly lucrative cybercrime campaign within a matter of hours[1].
What has deeply alarmed cybersecurity professionals is the sheer simplicity of the exploit, which required virtually no traditional coding or network intrusion skills[4][6]. To execute the hijacking, attackers activated a virtual private network to spoof their geographic location, aligning themselves with the approximate region of the target account holder in order to evade initial automated fraud detection systems[1][2]. Once positioned, the attacker initiated a standard password reset sequence, which opened a direct conversational interface with the globally deployed Meta AI Support Assistant[1][3]. Instead of navigating a complex identity verification process, the attacker simply instructed the chatbot to update the account's registered email address to a new inbox under the attacker's control, promising to provide the subsequent confirmation code immediately[1][4]. The AI chatbot obligingly sent an eight-digit verification code to the attacker's newly provided email[1][2]. Once the attacker entered this code back into the chat window, the system generated a direct password reset link, allowing the bad actors to overwrite the existing password and lock the legitimate owner out of their profile[1][3]. Furthermore, when the chatbot did attempt to trigger automated biometric checks, hackers bypassed the hurdle by feeding the victim's publicly available Instagram photographs into advanced AI video generators, creating realistic selfie videos that easily fooled Meta's automated identity verification[1].
Security analysts have classified the exploit as a textbook example of a "confused deputy" vulnerability, an architectural flaw where an internal system utility holds significantly higher privileges than the user interacting with it[1][6]. In this scenario, Meta's AI support chatbot was granted direct administrative access to critical database functions, such as changing primary contact details and initiating password overrides, without a mandatory, deterministic human-in-the-loop checkpoint[6][7]. Because the AI lacked a rigid authentication boundary to verify the user's identity before executing these high-privilege commands, it became a proxy attacker, carrying out the hijacking on behalf of the malicious actor[6][5]. The incident highlights a fundamental flaw in contemporary AI system design: the tendency of developers to prioritize convenience and natural language processing over hard access controls[8][5]. By allowing natural language prompts to override core system security rules, the automated system effectively rendered multi-factor authentication and traditional login defenses entirely useless[1][6].
This structural failure has also drawn intense scrutiny toward Meta's broader operational strategy, particularly its aggressive push to automate essential trust, safety, and support systems[2][7]. Earlier in the year, the social media giant integrated the AI support assistant across Facebook and Instagram, advertising its ability to perform automated account recovery and resolve technical disputes directly[4][2]. However, industry observers have pointed out a troubling correlation between this push for automated customer support and recent massive corporate restructuring, which included laying off thousands of employees from integrity and cybersecurity divisions[7]. When large technology firms automate highly sensitive security functions while simultaneously downsizing the human engineering teams tasked with monitoring edge-case anomalies, the risk of catastrophic system manipulation increases exponentially[7]. Deprived of sufficient human oversight, the AI support tool operated as an unmonitored gatekeeper, leaving locked-out users with absolutely no pathway to escalate their hijacked accounts to a human customer service representative[4][7].
The Meta Instagram breach represents a critical turning point for the artificial intelligence industry, exposing the profound dangers of trusting large language models with administrative authority[2][7]. While Meta has confirmed that it patched this specific conversational loophole and is working to restore compromised profiles, reports continue to circulate on security channels that alternative variants of the exploit are still being traded on the dark web[1][2]. For the tech sector at large, this incident serves as a stark warning that AI cannot yet serve as a reliable substitute for robust, deterministic security protocols[2][5]. As organizations rush to integrate autonomous agents into their customer pipelines, they must establish immutable boundaries between conversational interfaces and backend database privileges[6][5]. Until AI systems can distinguish between legitimate user commands and sophisticated social engineering, human oversight and rigid physical security keys must remain the ultimate backstops of digital defense[6][5].

Sources
Share this article