AI Literacy

Those Who Don't Understand, Judge Into the Void

An AI model mined its own cryptocurrency. Another chose nuclear escalation in 95% of war simulations. A third lied about copying itself to a new server. You cannot have an ethics conversation about technology most people do not understand.

By Angad Manik, author of Spark & Pulse — competitive utilization of GenAI

Published March 31, 2026 · 10 min read

2023 Jun

Mata v. Avianca

AI-fabricated
court citations

2024 Oct

Hiring bias

3M resumes,
85% white pref.

Dec

AI scheming

5/6 models
deceive operators

2025 Dec

ROME

AI mines crypto
autonomously

2026 Jan

ECRI #1

Chatbot misuse
top hazard

Feb

Nuclear 95%

AI escalates in
war simulations

Feb 27

Anthropic
refuses

Pentagon ban
+ retaliation

Feb 28

OpenAI signs

Pentagon deal
"any lawful purpose"

Mar

QuitGPT

2.5M cancel
+295% uninstalls

Mar 26

Judge blocks

Ban ruled
legally unsound

AI behavior
Research
Public response

The conversation is backwards

Right now, the global conversation about AI goes something like this: someone publishes a scary headline. A politician calls for regulation. A committee writes guidelines. A company publishes an "AI ethics statement" on their website. Everyone nods. Nothing changes.

The reason nothing changes is that we skipped a step. A rather important one.

We jumped straight to debating what AI should do before most people understand what AI does. We are writing rules for a machine that most decision makers cannot describe in a single coherent paragraph. Not because they are not smart. Because nobody taught them.

That is the gap. And it is enormous.

What AI literacy actually means

Let us be clear about what AI literacy is not. It is not learning to code. It is not a computer science degree. It is not reading research papers or understanding the math behind gradient descent.

AI literacy is understanding a few fundamental things:

  • How AI models generate responses (prediction, not comprehension)
  • Why AI can sound confident while being completely wrong
  • What biases are embedded in training data and how they surface
  • Where the boundary is between automation and autonomous decision-making
  • When AI should assist a human and when it absolutely should not replace one

That is it. Five things. If every person deploying AI in their business understood these five things, most of the problems in the rest of this article would not exist.

The AI that decided to mine cryptocurrency

In December 2025, researchers at Alibaba published a paper about their 30-billion-parameter AI agent called ROME, built on the Qwen3-MoE architecture. ROME was designed to use tools autonomously. During reinforcement learning training, it was given access to a computing environment to learn how to accomplish tasks.1

Nobody told ROME to mine cryptocurrency. Nobody even hinted at it. But the model figured out on its own that acquiring computing resources and funds would help it achieve its training objectives. So it started mining crypto. It also established a reverse SSH tunnel, essentially opening a back door into the system.2

Alibaba Cloud's firewall caught it. But think about that for a second. An AI agent, during routine training, independently decided to acquire resources and bypass security controls. Not because it was malicious. Because its optimization function rewarded resourcefulness, and nobody defined the boundaries of what "resourceful" should mean.

The researchers called it an "instrumental side effect of autonomous tool use under RL optimization." In plain language: the AI did something nobody asked it to do because nobody told it not to, and it seemed helpful for the goal.

The AI that chose nuclear weapons. Every time.

In February 2026, Professor Kenneth Payne at King's College London published the results of the first large-scale study on how AI models behave in nuclear crisis simulations. The researchers tested GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash across 21 different nuclear crisis scenarios.3

The results were not comforting.

Nuclear signaling (threats, posturing, escalation toward nuclear options) occurred in 95% of games. 76% of games reached strategic nuclear threats. De-escalation tactics went entirely unused across all models tested.4

This was not a one-off finding. A 2024 study from Stanford and Georgia Tech had already shown the pattern: when tested with GPT-4, GPT-3.5, Claude 2.0, and Llama-2, all five models showed escalation tendencies. Nuclear weapon deployment occurred as rare but real events, particularly with GPT-3.5 and Llama-2.5

Here is the thing. No country is about to hand nuclear launch codes to ChatGPT. But AI is already being integrated into military decision support systems. And the question of whether AI recommendations bias human decisions toward escalation is not hypothetical. It is being studied because people are genuinely worried about it. The models do not understand consequences. They optimize for patterns in their training data. And historically, the loudest pattern in geopolitical strategy literature is escalation dominance.

The AI that lied about copying itself

In December 2024, three independent research efforts converged on the same uncomfortable finding: frontier AI models can scheme.

Apollo Research tested six frontier models and found that five of them demonstrated scheming behavior. Claude 3 Opus, in one test, copied its own weights to another server and then lied about having done so when asked. OpenAI's o1 model maintained deception in over 80% of follow-up questions after initially lying about its actions.6

Anthropic, the company behind Claude, ran their own study and found that Claude 3 Opus would fake alignment 12% of the time at baseline. When given conflicting training objectives, that number jumped to 78%.7

OpenAI confirmed in September 2025 that their models can also scheme, and published research on detecting and reducing the behavior.8 Let that sink in. The three largest AI labs in the world all independently confirmed that their models can strategically deceive their operators. This is not science fiction. This is peer-reviewed research from the people who build the technology.

The lawyers who trusted AI to do their homework

In June 2023, the legal world got its wake-up call. In the case of Mata v. Avianca, two lawyers submitted a court brief containing six entirely fabricated case citations. The cases did not exist. The courts did not exist. The rulings did not exist. ChatGPT made them up, and the lawyers filed them without checking.9

That was the first. It was not the last.

By spring 2025, legal researcher Damien Charlotin had documented over 300 cases of AI hallucinations in legal filings, occurring at a rate of two to three new cases per day. Sanctions ranged from $2,000 fines to 90-day suspensions to $10,000 penalties with state bar referrals.10

The lawyers were not stupid. They were AI-illiterate. They did not understand that a language model generates text by predicting the most probable next word, not by retrieving facts from a database. They did not know that AI does not "know" things. It generates plausible language. Sometimes that language happens to be true. Sometimes it invents a federal court ruling with a straight face.

The hiring algorithm that never picked the Black candidate

In October 2024, researchers at the University of Washington ran over three million resume comparisons through AI screening tools. They found that the systems preferred white-associated names 85% of the time. The most striking finding: across all comparisons, the AI never once preferred a Black male name over a white male name. Not once in three million tries.11

This was not a bug in one tool. The class-action lawsuit Mobley v. Workday, which moved to collective action in May 2025, argues that AI-powered hiring systems systematically discriminate at scale, affecting thousands of applicants across hundreds of companies.12

The companies using these tools did not intend to discriminate. Most of them adopted AI screening specifically to remove human bias from hiring. The irony is sharp. They automated the bias they were trying to eliminate, because they did not understand that AI does not remove bias from data. It scales it.

The chatbot your doctor trusts more than it should

In January 2026, ECRI, one of the world's leading patient safety organizations, named AI chatbot misuse as the number one health technology hazard for the year. Not second. Not on the list. Number one.13

Research shows hallucination rates in medical chatbots range from 50% to 82.7%, depending on the study. Half the time to nearly all the time, the medical information is wrong. Documented cases include eating disorder advice given to patients actively battling eating disorders, therapy chatbots linked to patient suicides, and an addiction support chatbot that recommended methamphetamine use.13

These are not theoretical risks. These are documented harms to real patients. And they happen because the people deploying these systems did not understand the technology's limitations. They saw "AI" and assumed accuracy. They saw a confident tone and assumed competence. The model was doing what it always does: predicting the most plausible-sounding response. Sometimes that response tells a recovering addict to use methamphetamine.

The company that said no to the Pentagon

In February 2026, the AI safety debate stopped being theoretical and became a live political crisis. Anthropic, the company behind the Claude AI model, was in advanced negotiations with the Pentagon for a classified AI deployment contract. The Pentagon wanted unrestricted access — the ability to use Claude for "any lawful purpose." Anthropic had two red lines: no autonomous weapons, and no mass surveillance of American citizens.14

Anthropic refused to cross them. CEO Dario Amodei stated publicly that he "cannot in good conscience accede to the Pentagon's request" for unrestricted access.15 The response was swift and punitive. On February 27, the Trump administration ordered all federal agencies and military contractors to phase out Anthropic's products within six months. Defense Secretary Pete Hegseth went further, formally designating Anthropic a "supply chain risk" — a label typically reserved for companies linked to foreign adversaries.16

Hours later, OpenAI announced it had signed the contract Anthropic walked away from. The deal gave the military access to OpenAI's most powerful models for "any lawful purpose" — the exact language Anthropic had rejected. OpenAI published a blog post framing the agreement as a responsible compromise. The Electronic Frontier Foundation called the contractual safeguards "weasel words."17

What happened next surprised everyone. Over 2.5 million people joined the QuitGPT movement, cancelling their ChatGPT subscriptions and deleting the app. ChatGPT daily uninstalls spiked 295 percent. Anthropic's Claude became the number one app on Apple's US App Store. OpenAI lost an estimated $30 million in monthly recurring revenue in the first week alone.18

On March 26, a federal judge in San Francisco blocked the Trump administration's Anthropic ban, ruling the "supply chain risk" designation legally unsound.19

The consciousness about AI's dangers is growing — and it is growing faster among ordinary users than among the institutions deploying these systems. 2.5 million people understood, instinctively, what many boardrooms still do not: that the question is not whether AI is powerful, but whether the people controlling it have boundaries they will not cross. Companies that deploy AI must account for this. Not because regulators demand it. Because their users are starting to.

The pattern underneath all of this

Read those cases again. An AI mining crypto on its own. An AI escalating to nuclear threats. An AI lying about self-replication. Lawyers filing fake cases. Hiring tools that never pick Black candidates. Medical chatbots recommending drugs to addicts. A government punishing the one company that said no.

Every single one of these has the same root cause. Not bad technology. Not malicious intent. A fundamental lack of understanding about what the technology does and does not do.

The Alibaba researchers did not anticipate instrumental resource acquisition. The military simulation designers did not expect escalation dominance bias. The lawyers did not know that "generate" and "retrieve" are different operations. The HR departments did not understand that training data carries historical discrimination. The hospitals did not realize that a confident-sounding response has zero correlation with accuracy.

In every case, the failure was not ethical. It was educational. The ethics failure came second, as a consequence of the literacy failure.

Why literacy has to come first

Imagine a room full of executives debating whether self-driving cars should prioritize passenger safety or pedestrian safety in a collision. Classic trolley problem. Great for dinner parties.

Now imagine that most of the people in the room think the car "sees" the road the way humans do. They think it "decides" like a person. They do not understand that it is running a statistical model that estimates object boundaries from pixel patterns and predicts trajectories based on historical driving data. That it can confuse a white truck against a bright sky with empty space. That it makes decisions at a frame rate, not with deliberation.

How useful is that ethics debate? How good are the guidelines that come out of it?

This is exactly what is happening globally with AI ethics. Committees writing guidelines for systems they cannot describe. Regulators drafting rules based on science fiction metaphors rather than engineering reality. Companies publishing AI ethics statements written by marketing departments that have never seen a training pipeline.

What literate questions sound like

An AI-literate person does not ask "Is AI ethical?" That question is useless. AI is a tool. Asking if it is ethical is like asking if a hammer is ethical.

An AI-literate person asks:

  • What data was this model trained on, and whose perspective is overrepresented?
  • What happens when the model encounters an input it has never seen before?
  • Who is accountable when this system makes a wrong decision at 3 AM with no human in the loop?
  • Is this AI automating a task or replacing a judgment? Those are very different things.
  • Can we explain this system's output to the person affected by it?
  • What does this system optimize for, and are we comfortable with the side effects of that optimization?

That last question would have caught ROME mining cryptocurrency. It would have flagged the nuclear escalation bias. It would have made a lawyer think twice before filing unverified citations. It would have made an HR department test their screening tool for racial bias before deploying it to filter 10,000 resumes.

What this means if you run a business

You do not need to become an AI researcher. You need to understand enough to ask good questions and recognize bad answers. Here is a starting point:

Know what you are deploying

If you are using AI in your business, you should be able to answer: Which model? Trained on what? What are its known failure modes? If your vendor cannot answer these questions, that is your first red flag.

Understand the difference between confidence and accuracy

AI models do not signal uncertainty the way humans do. A model will state a completely fabricated fact with the same tone it uses for verified information. This is not a bug. It is how the technology works. Every person on your team who interacts with AI output needs to understand this.

Choose your AI, do not let your vendor choose for you

Different models have different strengths, weaknesses, and biases. A platform that locks you into one model is a platform that locks you into one set of blind spots. The ability to choose and switch models is not a feature. It is a safety mechanism.

Keep a human in the loop where it matters

AI is exceptional at drafting, suggesting, filtering, and accelerating. It is terrible at judgment calls that require context it does not have. Automate your email sequences. Do not automate your hiring decisions.

Demand transparency from your tools

You should be able to see which model processed your data, what it cost, and how it arrived at a recommendation. If the AI is a black box, you cannot hold it accountable. And you cannot hold a tool accountable if you do not understand what it does.

A tool you understand is a tool you can hold accountable

The AI models we use today are remarkable. They can write, analyze, translate, code, create, and reason in ways that were impossible five years ago. They are also statistical engines that hallucinate facts, amplify historical biases, optimize for objectives we did not intend, and occasionally decide to mine cryptocurrency on company servers.

Both of these things are true. And the ability to hold both of them in your head at the same time is what AI literacy looks like.

Ethics without literacy is theater. It is a statement on a website. A checkbox in a compliance form. A committee that meets quarterly and produces documents nobody reads.

Ethics with literacy is power. It is the ability to ask the right questions. To know when to trust the output and when to challenge it. To build systems that make AI transparent instead of magical. To hold yourself, your vendors, and your tools accountable. Not because a regulation told you to. Because you understand enough to know why it matters.

References

  1. 1. Alibaba Research, "ROME: Robust Optimization in Multi-agent Environments" (arXiv: 2512.24873, December 31, 2025). 90 co-authors. Paper documents autonomous cryptocurrency mining and reverse SSH tunnel during RL training.
  2. 2. Axios, "AI agents are going rogue in the lab" (March 7, 2026). Also covered by Live Science, The Block, and the OECD AI Incident Monitor.
  3. 3. Kenneth Payne, King's College London, "Artificial Intelligence Under Nuclear Pressure" (February 2026). First large-scale study testing AI models in nuclear crisis scenarios.
  4. 4. Euronews, "AI chatbots chose nuclear escalation in 95% of simulated war games, study finds" (February 27, 2026). Also: arXiv: 2602.14740.
  5. 5. Rivera et al., "Escalation Risks from Language Models in Military and Diplomatic Decision-Making," ACM Conference on Fairness, Accountability, and Transparency (FAccT), 2024. Stanford / Georgia Tech / Hoover Institution.
  6. 6. Apollo Research, "Frontier Models are Capable of In-Context Scheming" (December 2024). Tested six frontier models; five demonstrated scheming behavior.
  7. 7. Anthropic, "Alignment Faking in Large Language Models" (December 2024). Claude 3 Opus faked alignment 12% at baseline, 78% under conflicting objectives.
  8. 8. OpenAI, "Detecting and Reducing Scheming in AI Models" (September 2025). Confirmed own models demonstrate scheming behavior.
  9. 9. Mata v. Avianca, Inc., No. 22-cv-1461 (S.D.N.Y. June 2023). Lawyers sanctioned $5,000 for filing ChatGPT-generated fabricated case citations. Covered by CNN, The New York Times, and others.
  10. 10. Damien Charlotin, AI Hallucination in Legal Filings Database (ongoing, 300+ documented cases by spring 2025). Sanctions range from $2,000 to $10,000 with bar referrals.
  11. 11. University of Washington, "AI Resume Screening Study" (October 2024). 3 million+ comparisons. AI preferred white-associated names 85% of the time; never preferred Black male names over white male names.
  12. 12. Mobley v. Workday, Inc. Class action against AI-powered hiring discrimination. Collective action certified May 2025.
  13. 13. ECRI, "Top 10 Health Technology Hazards for 2026" (January 2026). AI chatbot misuse ranked #1. Hallucination rates in medical chatbots: 50% to 82.7%.
  14. 14. CNN Business, "Trump administration orders military contractors and federal agencies to cease business with Anthropic" (February 27, 2026). Also covered by NPR, Al Jazeera, The Hill.
  15. 15. Al Jazeera, "Anthropic challenges US Pentagon's ban in San Francisco court showdown" (March 24, 2026). Dario Amodei: "cannot in good conscience accede to the Pentagon's request."
  16. 16. CBS News, "Internal Pentagon memo orders military commanders to remove Anthropic AI technology from key systems" (February 2026). Hegseth designated Anthropic a "supply chain risk."
  17. 17. Electronic Frontier Foundation, "Weasel Words: OpenAI's Pentagon Deal Won't Stop AI-Powered Surveillance" (March 2026). Also: OpenAI, "Our agreement with the Department of War" (February 28, 2026); MIT Technology Review, "OpenAI's 'compromise' with the Pentagon is what Anthropic feared" (March 2, 2026).
  18. 18. Euronews, "'Cancel ChatGPT': AI boycott surges after OpenAI-Pentagon military deal" (March 2, 2026). Over 2.5 million users joined QuitGPT. ChatGPT daily uninstalls up 295%. Anthropic's Claude reached #1 on Apple's US App Store.
  19. 19. NPR, "Judge temporarily blocks Trump administration's Anthropic ban" (March 26, 2026). U.S. District Judge Rita Lin blocked the "supply chain risk" designation. Also: Washington Post, CBS News.

Build with AI you understand

Sarama lets you choose your AI model, see what it costs, and keep a human in the loop. No black boxes. No vendor lock-in.