Home Services Process Work Open Source Blog es Book a call
Blog

Ideas, guides, and
production lessons.

What we learn building AI agents, RAG systems, and Claude integrations for real clients.

The Citation Crisis: When AI Gets It Right But Points Wrong
AI May 2026 3 min

The Citation Crisis: When AI Gets It Right But Points Wrong

A new benchmark reveals that even top AI models frequently support correct answers with fabricated citations—a flaw called 'attribution hallucination' that undermines trust in regulated industries.

Read article
AI Agents Can Now Write Browser Exploits—And That Changes Everything
AI May 2026 5 min

AI Agents Can Now Write Browser Exploits—And That Changes Everything

New benchmark reveals Claude Mythos and GPT-5.5 can autonomously develop real browser exploits, raising urgent questions about AI safety and the future of cybersecurity.

Read article
The End of Scaling Lies: Ernie 5.1 Cuts 94% of Pre-Training Costs Without Sacrificing Performance
AI May 2026 4 min

The End of Scaling Lies: Ernie 5.1 Cuts 94% of Pre-Training Costs Without Sacrificing Performance

Baidu's Once-For-All training method achieves frontier-level performance using only 6% of the compute cost, proving that massive spending isn't the only path to top-tier AI.

Read article
Why Bigger Language Models Actually Work: The Geometry Behind Scaling Laws
AI May 2026 4 min

Why Bigger Language Models Actually Work: The Geometry Behind Scaling Laws

MIT researchers trace scaling laws to superposition—a geometric property where LLMs pack more concepts into limited dimensions than theoretically possible.

Read article
Your Job Isn't Vanishing – It's Expanding: Why AI Agents Make Engineers More Essential
AI April 2026 4 min

Your Job Isn't Vanishing – It's Expanding: Why AI Agents Make Engineers More Essential

A new study argues AI agents don't replace software engineers but expand the discipline into strategy, governance, and societal fit. The real risk is clinging to code.

Read article
Small But Mighty: Qwen3.6-27B Shatters the Bigger Is Better Myth
AI April 2026 3 min

Small But Mighty: Qwen3.6-27B Shatters the Bigger Is Better Myth

Alibaba's new 27B dense open-source model outperforms its 397B MoE predecessor on coding benchmarks, signaling a shift toward efficiency over brute-force scale.

Read article
Free Resource

Get the AI Implementation Checklist

10 questions every team should answer before building AI systems. Avoid the most common mistakes we see in production projects.

Check your inbox!

We've sent you the AI Implementation Checklist.

No spam. Unsubscribe anytime.