About SCAND.Ai
Independent AI Industry Watchdog
Real-time monitoring of AI controversies across 6 sources, 5-layer dedup, AI-powered analysis.
What We Do
SCAND.Ai monitors the AI industry for emerging controversies, heated debates, and significant disputes in real-time. We track social media, news sources, and community platforms to surface stories as they develop.
SCAND.Ai is an approved Google News publisher, providing real-time AI industry coverage to news aggregators and search engines worldwide.
Data Sources
Content is crawled from multiple source types:
- Twitter/X โ timelines of key AI industry figures and trending search terms
- Reddit โ 29 AI-focused subreddit RSS feeds
- Hacker News โ front page and new stories
- Bluesky โ AI community feeds
- News RSS โ 26+ curated publications including ArXiv, Google News, and major AI outlets
Analysis Pipeline
Every piece of content goes through a multi-stage pipeline:
- Ingestion โ content is deduplicated (SHA-256 + SimHash near-duplicate detection) and matched against known AI figures
- Spike Detection โ a keyword co-occurrence buffer clusters related content into topics
- Classification โ Gemini Flash determines if content represents a genuine controversy
- Full Analysis โ Gemini Flash produces structured analysis: summaries, key points, party positions, timeline, and forecast
Editorial Process
SCAND.Ai combines AI-powered analysis with human editorial oversight. When the system identifies a potential controversy, it generates a candidate analysis and sends it to our editorial team via Telegram for review.
Each candidate is reviewed by a human editor who can approve, modify, or reject it. Only approved topics are published on the site. This ensures every published analysis meets our editorial standards for accuracy, relevance, and fairness.
High-confidence detections (genuine multi-party debates with strong signals) may be auto-approved, but are still subject to post-publication review.
Noise Score
Each controversy receives a noise score from 0 to 100, computed from seven weighted factors:
- Reach (20%) โ total audience exposed
- Engagement (20%) โ interaction velocity (posts per hour)
- Star Power (15%) โ involvement of high-profile industry figures
- Cross-Platform (15%) โ spread across multiple sources
- Duration (10%) โ how long the story has been active
- Polarity (10%) โ how divided opinions are
- Industry Impact (10%) โ potential lasting effect on the AI field
A 7-day half-life decay ensures scores naturally decrease as controversies lose momentum.
State Machine
Topics progress through five states based on signal thresholds:
- Emerging โ initial detection of a potential controversy
- Growing โ content velocity exceeds 15 items/hour
- Debated โ spread across 2+ platforms
- Major Controversy โ noise score reaches 75+
- Resolved โ activity drops below threshold after cooldown period
Each transition has a cooldown (12โ48 hours) before decay can step the topic back to a lower state.
AI Disclosure
Summaries, key points, forecasts, and party analysis are generated by Gemini (Google). All AI-generated content is based on sourced material from the data sources listed above. Forecasts are probabilistic assessments based on current signals โ events may develop differently.
We are transparent about our use of AI: every analysis includes the original source material it was derived from, and our methodology is fully documented on this page.
Corrections Policy
We strive for accuracy in all our reporting. If you spot an error in our analysis, factual claims, or party attribution, please contact us at contact@scand.ai. We investigate all reports and publish corrections promptly.
Contact
For editorial inquiries, corrections, or partnership requests: contact@scand.ai
Subscribe to AI Watch: scand.ai/newsletter
Telegram: @scand_ai