News

AI / agent 工程资讯聚合,采用 linkblog / reading log 方案

来源

OpenAI News

news

OpenAI 产品、工程和公司发布

访问来源 →

OpenAI Cookbook

cookbook

实操、范例和 prompt / agent 工程文章

访问来源 →

OpenAI Codex Best Practices

docs长期有效

Codex 使用文档和最佳实践

访问来源 →

Anthropic Engineering

news

context engineering、agent 方法论、eval、tool use 和 coding agent 实践

访问来源 →

Simon Willison Weblog

news

高频独立观察来源

访问来源 →

Agentic Engineering Patterns

guide长期有效

长期有效的专题指南

访问来源 →

长期有效资源

guidesimonwillison-weblog

Agentic Engineering Patterns

Simon Willison 总结的 agent 工程模式,涵盖任务分解、工具使用和错误处理等核心概念。

agent工程模式任务分解工具使用

最新资讯

newsopenai-news

Previewing GPT-5.6 Sol: a next-generation model

OpenAI previews GPT-5.6 Sol, a next-generation model with stronger capabilities in coding, science, and cybersecurity, paired with its most advanced safety stack.

GPTOpenAICodingSafety
newssimonwillison-weblog

Quoting Dean W. Ball

This is a bad state of affairs. Consider, in particular, some industry dynamics: Frontier models are trained at an enormous cost, and a significant fraction of that cost is recouped in the few post...

Quoting
newssimonwillison-weblog

Quoting Timothy B. Lee

This is like saying there's no learning curve to being a manager because your employees will just do whatever you tell them to do. — Timothy B. Lee, on the idea that LLMs take no skill and ha...

Quoting
newssimonwillison-weblog

Incident Report: CVE-2026-LGTM

Incident Report: CVE-2026-LGTM Spectacular hypothetical incident report by Andrew Nesbitt. Day 2, 16:00 UTC --- Two AI review agents from competing vendors, both attached to a downstream pull reque...

Incident
newssimonwillison-weblog

Quoting OpenAI

We're beginning a limited preview of the GPT‑5.6 series: Sol, our flagship model; Terra, a balanced model for everyday work; and Luna, a fast and affordable model. Terra has competitive performance...

OpenAI
newsopenai-news

How agents are transforming work

A new OpenAI research paper shows how AI agents are transforming work, enabling longer, more complex tasks and expanding productivity across roles.

OpenAI
newssimonwillison-weblog

AI and Liability

AI and Liability Bruce Schneier on the recent German ruling that Google be held liable for errors introduced in their AI overviews: AI agents are agents of the person or organization that deploys t...

Liability
newssimonwillison-weblog

datasette-export-database 0.3a2

Release: datasette-export-database 0.3a2 An embarrassingly tiny release. The pyproject.toml had pinned to datasette==1.0a27, inadvertently making this plugin incompatible with all other Datasette v...

DatasetteDatabase
newssimonwillison-weblog

simonw/browser-compat-db

simonw/browser-compat-db Inspired by Mozilla's new MDN MCP service - source code here - I decided to try converting their comprehensive mdn/browser-compat-data repository full of browser compatibil...

Simonw/browser-compat-db
newssimonwillison-weblog

Quoting Tom MacWright

In the last few months, I've started to see [job applications] that were clearly cowritten by an LLM, link to an LLM-generated portfolio site, which then links to LLM-generated GitHub projects, wit...

Quoting
newsopenai-news

Helping build shared standards for advanced AI

OpenAI helps build shared standards for advanced AI, supporting evaluation frameworks, safety practices, and global cooperation through the Appia Foundation.

OpenAISafety
newssimonwillison-weblog

datasette 1.0a35

Release: datasette 1.0a35 I'll write more about this one soon, but it's a big release. Three highlights from the release notes: New "Create table" interface in the database actions menu, backed by ...

Datasette
newssimonwillison-weblog

OPFS + Pyodide test harness

Tool: OPFS + Pyodide test harness I've been pondering if Datasette Lite - the Python Datasette application run entirely in the browser using Pyodide and WebAssembly - might be able to edit persiste...

Testing
newsopenai-news

Codex-maxxing for long-running work

Learn how Jason Liu uses Codex to preserve context, manage complex projects, and help work continue beyond a single prompt.

CodingPrompt
newssimonwillison-weblog

Prompt Injection as Role Confusion

Prompt Injection as Role Confusion First, I absolutely love this: This is a blog-style writeup of the paper. I wish every paper would come with one of these. Academic writing is pretty dry - the im...

Prompt
newssimonwillison-weblog

sqlite-utils 4.0rc1 adds migrations and nested transactions

sqlite-utils is my combined Python library and CLI tool for working with SQLite databases. It provides an extensive set of higher-level operations on top of Python's default sqlite3 package, includ...

SQLite
newssimonwillison-weblog

sqlite-utils 4.0rc1

Release: sqlite-utils 4.0rc1 See sqlite-utils 4.0rc1 adds migrations and nested transactions. Tags: sqlite-utils

SQLite
newssimonwillison-weblog

Temporary Cloudflare Accounts for AI agents

Temporary Cloudflare Accounts for AI agents The announcement says this is "for AI agents" but (as is pretty common these days) the AI hook isn't really necessary, this is an interesting feature for...

Temporary
newssimonwillison-weblog

Quoting Sean Lynch

The real valuable capability MCP offers over skills/CLI is isolating the auth flow outside of the agent’s context window, and potentially out of the harness completely. [...] Maybe the idealized fo...

Quoting
newsopenai-news

Improving health intelligence in ChatGPT

Learn how GPT-5.5 Instant improves ChatGPT’s health and wellness responses with stronger reasoning, better context, clearer communication, and physician-informed evaluations.

GPTReasoningChatGPT
newssimonwillison-weblog

datasette-acl 0.6a0

Release: datasette-acl 0.6a0 This release expands datasette-acl from table-only permissions toward a general resource-sharing system. Alex Garcia did most of the work for this release - we're flesh...

Datasette
newsopenai-news

Introducing LifeSciBench

Introducing LifeSciBench, an expert-authored, expert-reviewed benchmark for evaluating how AI systems handle real-world life science research tasks and decisions.

Benchmark
newsopenai-news

Introducing the OpenAI Partner Network

OpenAI launches the Partner Network, investing $150M to help global partners accelerate enterprise AI adoption, deployment, and transformation.

OpenAI
newsopenai-news

Introducing GPT-4.1

OpenAI 发布 GPT-4.1,带来更强的编码能力和更长的上下文窗口。

GPT-4.1模型发布编码能力