Share:

Google confirms criminal hackers used AI to discover serious software flaw

What was once considered a hypothetical scenario, almost science fiction within cybersecurity, just became reality. For years, researchers debated whether artificial intelligence models could go beyond creative and analytical tasks to act as offensive tools in digital attacks. The answer arrived in a surprisingly concrete way, without warning and with all the evidence documented by one of the largest security teams on the planet.

In a report released this Monday, Google revealed that a criminal group used artificial intelligence to identify an unknown flaw in a widely used open-source web administration tool. The process did not involve manual trial and error, nor weeks of human analysis poring over lines of code. It was an AI model operating systematically, scanning possibilities and identifying entry points that would have gone unnoticed by any conventional analyst working within a normal timeframe.

It was not a coincidence, it was not luck — it was artificial intelligence actively working to find gaps that humans might have taken months to discover. And the detail that stands out the most: this is the first confirmed case in which the exploitation of a zero-day vulnerability was primarily conducted with the help of an AI model. For digital security experts, this moment carries enormous weight. It is not just another incident in the long history of cyberattacks around the world. It is a turning point — the kind of event that splits the timeline into before and after.

As John Hultquist, chief analyst at the Google Threat Intelligence Group, put it quite bluntly: this is just the tip of the iceberg.

What exactly happened in this attack

According to the report published by Google, the zero-day vulnerability was detected by the Google Threat Intelligence Group in recent months. The hacker group — identified by Google only as prominent cybercrime actors — exploited the flaw through a script written in the Python programming language. The breach would have allowed the attackers to bypass two-factor authentication on a web-based system administration tool that is widely used and open-source.

However, for the attack to succeed, the hackers would also need access to valid credentials, such as usernames and passwords. In other words, the vulnerability served as an additional layer of exploitation, not a completely autonomous entry point. Still, the potential for damage was significant, considering that web administration tools typically have privileged access to entire systems.

Google chose not to reveal which administration tool was affected, but reported that they notified the software developer quickly enough for a fix to be applied before the attack caused any real damage. The company also did not publicly identify the responsible hacker group or reveal which AI platform was used, but made a point of stating that it does not believe its own Gemini chatbot was involved.

What truly makes this case unique is how Google reached the conclusion that AI was involved. The report states with a high degree of confidence that the malicious actor likely used an AI model to support both the discovery and the weaponization of the vulnerability. Among the clues that led to this conclusion were unusual elements in the code, such as overly detailed explanatory text and other oddities that a human programmer would have no reason to include. These are traces that Rob Joyce, former cybersecurity director at the National Security Agency, described as the closest thing to a fingerprint at the crime scene.

Receive the best innovation content in your email.

All the news, tips, trends, and resources you're looking for, delivered to your inbox.

By subscribing to the newsletter, you agree to receive communications from Método Viral. We are committed to always protecting and respecting your privacy.

Joyce, who reviewed the findings before publication, emphasized that it is normally difficult to distinguish whether code was written by a human or a machine, since AI-generated code does not announce itself. But in this specific case, the evidence presented by Google was considered quite convincing.

The broader context: AI as a cyber weapon

This incident does not exist in a vacuum. It connects to a series of recent events that are reshaping the landscape of global cybersecurity.

Late last year, Anthropic revealed that hackers sponsored by the Chinese government used its technology in an attempt to infiltrate computer systems at approximately 30 companies and government agencies around the world. That episode marked the first reported case of a cyberattack in which AI collected sensitive information with limited human involvement. Now, with the case revealed by Google, the scenario advances one more step — from passive intelligence gathering to the active discovery of vulnerabilities.

There is also the case of Mythos, Anthropic’s AI model announced in April of this year, which raised concerns to unprecedented levels. According to Anthropic, Mythos identified thousands of zero-day vulnerabilities across all major operating systems and all major web browsers, including many flaws that had existed for decades without being detected. The destructive potential of this capability was so alarming that Anthropic chose to share the model only with a limited number of companies and government agencies in the United States and the United Kingdom.

These events, combined, are pressuring governments and the tech industry to urgently reassess how — and whether — they should police the most advanced versions of AI. The Trump administration in the United States has been reviewing proposals that include a formal government review process for new models before they are made available to the public, as reported by The New York Times. The idea of controlled releases of the latest models, allowing experts to identify and fix issues before they fall into the wrong hands, is gaining increasing traction in the international debate. 🔐

Why this changes everything in cybersecurity

Cybersecurity has always been a two-sided contest: on one side, researchers and companies trying to find and fix flaws before someone exploits them; on the other, malicious groups trying to discover those same flaws before they get patched. This precarious balance was built over decades based on a silent assumption — that both attackers and defenders were operating with more or less equivalent human resources.

With AI entering this equation, that assumption collapses. The digital battlefield now has an asymmetry we do not yet know how to balance.

To put it in perspective, zero-day vulnerabilities were considered so rare and powerful that they could be worth millions of dollars on the underground markets used to trade hacking tools. Finding one required months of specialized work, highly skilled teams, and a fair amount of luck. Now, with AI models capable of analyzing millions of lines of code in a fraction of the time any human team would need, the scarcity of these vulnerabilities may cease to exist — and with it, the entire security model that depended on that rarity.

And the most concerning part: this type of technology is not restricted to governments or large criminal organizations. With the advancement and democratization of artificial intelligence, it is only a matter of time before smaller groups also gain access to similar capabilities, making the threat landscape even more fragmented and unpredictable.

On the defensive side, the most logical response would be to use the same technology to strengthen security — and that is already happening in labs around the world. But there is a structural problem in this race: while attackers only need to find one gap to succeed, defenders need to close all of them. The asymmetry is not just technological, it is also strategic. Cybersecurity teams now need to think not only about how to protect systems, but about how to anticipate what an adversarial AI would be capable of finding before they do.

The evidence linking the attack to artificial intelligence

One of the most fascinating aspects of this case is the forensic methodology Google employed to attribute the attack to AI. As Rob Joyce pointed out, AI-generated code does not carry a stamp or signature saying it was made by a machine. So how did Google reach that conclusion?

The report points to anomalies in the code that would be highly unusual in human work. Excessively detailed explanatory comments, formatting patterns that make no sense from an experienced developer’s perspective, and certain structural choices that carry the hallmark of generative language models — all of this created a picture that, according to Joyce, is the closest we have ever come to a fingerprint at the crime scene when it comes to AI-generated code.

Hultquist confirmed that Google possesses additional indicators supporting the conclusion but chose not to disclose them publicly. This stance makes strategic sense: revealing all detection techniques would allow future attackers to adjust their methods to avoid identification, turning transparency into a tactical disadvantage.

This question of attribution — knowing who is behind an attack and what tools were used — is one of the biggest challenges in modern cybersecurity. With AI entering the equation on both the offensive and defensive sides, this challenge becomes exponentially more complex.

The other side of the coin: AI as a defense ally

Despite the concerning scenario, it is not all bad news. Some experts believe that, in the long run, artificial intelligence will strengthen cybersecurity in significant ways. The logic is relatively straightforward: if AI can find vulnerabilities with unprecedented efficiency, it can also help write flawless code from the start.

Hultquist himself expressed this view with cautious optimism. According to him, the most advanced AI models will make it possible to build the most secure code humanity has ever produced. That would represent an absolute win for cybersecurity.

But — and here is the problem — that future is still far off. The immediate challenge is dealing with all the code that already exists in the world, written by imperfect human hands over decades. There are billions of lines in systems that support critical infrastructure, financial platforms, communication networks, and much more. Rewriting everything from scratch is not feasible. Auditing everything with AI will take time. And meanwhile, the attackers are not going to wait.

Tools we use daily

The digital arms race has gained a new chapter, and it is being written in real time. On one side, defense teams trying to use AI to scan and fix vulnerabilities at scale. On the other, criminal groups and even governments using the same technology to find those vulnerabilities first. Whoever gets there first determines the outcome. 🤖

What lies ahead in the AI threat landscape

John Hultquist, when commenting on the case publicly, was categorical in saying that this is only the beginning. The expression tip of the iceberg is not rhetoric — it reflects a very concrete technical concern. If one group has already managed to use AI to discover and exploit a zero-day in a widely used tool, what stops other groups from doing the same with even more critical systems?

Energy infrastructure, hospital networks, financial systems, and government communications are all potential targets, and all of them depend on software with the potential to contain vulnerabilities still unknown. The scale of the problem is hard to overstate.

The cybersecurity community is digesting this event with a mix of urgency and clarity. Many experts are already pointing out that the coming months will be decisive in defining how governments, tech companies, and security teams respond to this new paradigm. Some countries are already discussing specific regulations for the use of AI in offensive security contexts, while organizations like Google, Microsoft, and various independent research groups are accelerating the development of AI-based defensive tools.

Google’s discovery also strengthens the arguments in favor of controlled releases of the most advanced AI models. The idea is that, before a new model is made widely available, security experts should have the opportunity to evaluate its offensive capabilities and develop countermeasures. Anthropic’s Mythos model already followed this path by being shared only with selected entities. The question now is whether this approach will be adopted more broadly by the industry — or whether competitive market pressure will push companies to release increasingly powerful models without proper safeguards.

For anyone following the world of technology and artificial intelligence, this event serves as a powerful reminder that advances in AI carry with them both incredible possibilities and real risks that need to be taken seriously. The same type of model that helps doctors diagnose diseases, that accelerates scientific research, and that improves digital products around the world can also be aimed at finding gaps in critical systems with an efficiency no human being can replicate.

Understanding this duality is not pessimism — it is the first step toward building a more secure digital future that is aware of the challenges ahead.

Picture of Rafael

Rafael

Operations

I transform internal processes into delivery machines — ensuring that every Viral Method client receives premium service and real results.

Fill out the form and our team will contact you within 24 hours.

Related publications

Amazon's stock could rise following OpenAI partnership.

Amazon and OpenAI partnership could boost AI revenue and stock value, says Citi; strategic impact on AWS and infrastructure race.

Moratorium on AI Data Centers: Energy in Debate

Sanders and AOC propose moratorium on AI datacenter construction in the US to assess environmental and energy impacts.

Blockchain and AI Agents Are Changing Crypto Payments

AI agents power crypto payments with blockchain, stablecoins and x402, enabling autonomous transactions, micropayments and machine-to-machine economy

Receba o melhor conteúdo de inovação em seu e-mail

Todas as notícias, dicas, tendências e recursos que você procura entregues na sua caixa de entrada.

Ao assinar a newsletter, você concorda em receber comunicações da Método Viral. A gente se compromete a sempre proteger e respeitar sua privacidade.

Rafael

Online

Atendimento

Website Pricing Calculator

Find out how much the ideal website for your business costs

Website Pages

How many pages do you need?

Drag to select from 1 to 20 pages

In just 2 minutes, automatically find out how much a custom website for your business costs

More than 0+ companies have already calculated their quote

Fale com um consultor

Preencha o formulário e nossa equipe entrará em contato.