2026 / Week 2

www.pentestpartners.com

https://www.pentestpartners.com/security-blog/eurostar-ai-vulnerability-when-a-chatbot-goes-off-the-rails/

https://tonsky.me/blog/tahoe-icons/

www.together.ai

https://www.together.ai/blog/evaluate-and-benchmark-llms

www.youtube.com

https://www.youtube.com/watch?v=ULszsXDyjMY

https://llmindex.net/benchmarks

https://llm-stats.com/

https://llm-stats.com/

https://www.inc.com/jessica-stillman/google-co-founder-sergey-brins-unretirement-is-a-lesson-for-the-rest-of-us/91280208

artificialanalysis.ai

https://artificialanalysis.ai/methodology/intelligence-benchmarking

https://namangarg.in/this-has-never-felt-new-to-me/

www.youtube.com

https://www.youtube.com/watch?v=xRh2sVcNXQ8

https://manus.im/blog/manus-100m-arr

https://manus.im/blog/Context-Engineering-for-AI-Agents-Lessons-from-Building-Manus

https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/

www.incompleteideas.net

http://www.incompleteideas.net/IncIdeas/BitterLesson.html

zhengdongwang.com

https://zhengdongwang.com/2025/12/30/2025-letter.html

www.youtube.com

https://www.youtube.com/watch?v=vih5tkdSGHk

blog.getmocha.com

https://blog.getmocha.com/no-escape-hatch-engineering-behind-mocha/

www.anthropic.com

https://www.anthropic.com/engineering/effective-harnesses-for-long-running-agents

www.anthropic.com

https://www.anthropic.com/engineering/demystifying-evals-for-ai-agents