LLM-Generated Peer Review Falsely Accuses Author of Hallucination

AI-AnalyzedAnalysis generated by Gemini, reviewed editorially. Methodology

Why It Matters

This incident highlights the breakdown of academic integrity when AI tools are used to automate the peer review process without human oversight. It threatens the credibility of top-tier AI conferences and the professional standing of researchers unfairly accused of misconduct.

Key Points

A reviewer for the ARR March Cycle accused an author of academic misconduct based on 'hallucinated' references that were not present in the paper.
Evidence suggests the reviewer used an LLM to generate the critique, which then hallucinated flaws the reviewer failed to verify.
The reviewer assigned themselves a 'Confidence 4' rating despite clearly not reading the manuscript's bibliography.
The incident has raised serious concerns about the integrity of the peer review system in top-tier AI and NLP venues.
The author is now tasked with navigating a rebuttal process against a review that does not engage with the actual content of their work.

An AI researcher has publicly criticized the peer review process of the ACL Rolling Review (ARR) March Cycle after receiving an official critique containing false ethical allegations. The reviewer, who claimed a high confidence score of 4, accused the author of 'hallucinating' references and fabricating a bibliography. However, the author discovered that none of the cited 'fake' references existed in their submitted manuscript, leading to the conclusion that the reviewer used a Large Language Model (LLM) to generate the review. The LLM apparently hallucinated errors that did not exist in the source text, which the reviewer then copy-pasted into the official evaluation. This case has sparked renewed debate regarding the declining quality of peer review in the machine learning community and the irony of using AI to incorrectly police AI-generated content.

Imagine a teacher using an AI to grade your homework, and the AI says you copied a paragraph that isn't even in your paper. That is exactly what happened to an AI researcher during a major conference review. A reviewer used an AI to write their feedback, and the AI made up a list of 'fake' sources it claimed the author used. The reviewer didn't even check if those sources were actually in the paper before submitting the critique. Now, the researcher is stuck defending themselves against a robot's imagination, showing how lazy AI use is breaking the scientific review system.

Sides

Critics

/u/ConcernConscious4131 (Corresponding Author)C

Argues that the peer review system is broken due to reviewers using LLMs to automate critiques without reading the actual manuscripts.

Defenders

Anonymous Reviewer (ARR March Cycle)C

Claimed high confidence while accusing the author of fabricating references, likely via a hallucinated AI-generated output.

Neutral

ACL Rolling Review (ARR)C

The governing body responsible for the review process currently under fire for quality control issues.

Join the Discussion

Discuss this story

HN Reddit Bluesky Telegram

Community comments coming in a future update

Be the first to share your perspective. Subscribe to comment.

Noise Level

Reach

Engagement

Star Power

Duration

Cross-Platform

Polarity

Industry Impact

Forecast

AI Analysis — Possible Scenarios

Conference organizers are likely to face pressure to implement stricter 'Human-in-the-Loop' requirements for reviewers and may deploy LLM-detection tools for reviews. Expect a formal update to ACL and ARR policies specifically banning or strictly regulating the use of generative AI in drafting peer evaluations.

Based on current signals. Events may develop differently.

Timeline

Today

Apr 29, 2026R@/u/ConcernConscious4131

[D] The state of Peer Review: Reviewer uses LLM to accuse me of "Hallucinated References" that don't even exist in my paper.

[D] The state of Peer Review: Reviewer uses LLM to accuse me of "Hallucinated References" that don't even exist in my paper. Hi everyone. I’m not sure if you remember me, but I’m the guy who was practically living on soju and whisky while waiting for the last ACL results. Well, I…

View original →▲ 10

Timeline

Apr 29, 08:00 AM
Author verifies manuscript
After an internal audit, the author confirms the 'hallucinated' references listed by the reviewer do not exist in the submitted PDF.
Apr 29, 05:47 AM
Review results released
The author receives a review accusing them of hallucinating references and fabricating their bibliography.
Mar 1, 12:00 AM
ARR March Cycle begins
Papers are submitted for review in the ACL Rolling Review cycle.

LLM-Generated Peer Review Falsely Accuses Author of Hallucination

Why It Matters

Key Points

Sides

Critics

Defenders

Neutral

Join the Discussion

Noise Level

Forecast

Timeline

Today

Timeline

Author verifies manuscript

Review results released

ARR March Cycle begins