Steam, Steel, and Infinite Minds——Ivan Zhao

2025 年 12 月 30 日·27 min read·5335 字

Steam, Steel, and Infinite Minds——Ivan Zhao

蒸汽、钢铁、还有无限头脑——Ivan Zhao

Master the Material Define the Era.第一部分：历史的材料与卡内基的时代

Every era is shaped by its miracle material. Steel forged the Gilded Age. Semiconductors switched on the Digital Age. Now AI has arrived as infinite minds. If history teaches us anything, those who master the material define the era.

每个时代都由其“奇迹材料”所塑造。钢铁锻造了镀金时代，半导体开启了数字时代。现在，AI 作为“无限心智”已经到来。如果历史能教会我们什么，那就是：谁掌握了材料，谁就定义了时代。

Left: teenage Andrew Carnegie and his younger brother.左图：青少年时期的安德鲁·卡内基和他的弟弟。
Right: Pittsburgh steel factories during the Gilded Age.右图：镀金时代的匹兹堡钢铁厂。

Left: teenage Andrew Carnegie and his younger brother.左图：青少年时期的安德鲁·卡内基和他的弟弟。 Right: Pittsburgh steel factories during the Gilded Age.右图：镀金时代的匹兹堡钢铁厂。

In the 1850s, Andrew Carnegie ran through muddy Pittsburgh streets as a telegraph boy. Six in ten Americans were farmers. Within two generations, Carnegie and his peers forged the modern world. Horses gave way to railroads, candlelight to electricity, iron to steel.

1850 年代，安德鲁·卡内基（Andrew Carnegie）还是个在匹兹堡泥泞街道上奔跑的电报员。当时六成美国人是农民。不到两代人的时间，卡内基和他的同辈们就锻造了现代世界：马车让位于铁路，烛光让位于电力，生铁进化为钢铁。

The Rearview Window 第二部分：后视镜理论

Since then, work shifted from factories to offices. Today I run a software company in San Francisco, building tools for millions of knowledge workers. In this industry town, everyone is talking about AGI, but most of the two billion desk workers have yet to feel it. What will knowledge work look like soon? What happens when the org chart absorbs minds that never sleep?

从那时起，工作重心从工厂转移到了办公室。今天，我在旧金山经营着一家软件公司，为数百万知识工作者构建工具。在这个工业小镇上，每个人都在谈论 AGI（通用人工智能），但全球 20 亿办公族中的大多数人尚未真正感受到它。当组织架构吸纳了那些永不眠的心智时，会发生什么？

Early movies often looked like stage plays, with one camera focused on the stage.早期的电影通常看起来像舞台剧，镜头始终固定在舞台上。

This future is often difficult to predict because it always disguises itself as the past. Early phone calls were concise like telegrams. Early movies looked like filmed plays. (This is what Marshall McLuhan called "driving to the future via the rearview window.")

未来往往难以预测，因为它总是伪装成过去。早期的电话像电报一样简洁；早期的电影看起来像拍摄出来的舞台剧。（这就是马歇尔·麦克卢汉所说的“通过后视镜驶向未来”。）

The most popular form of AI today looks like Google search of the past. To quote Marshall McLuhan: "we are always driving into the future via the rearview window."今天最流行的 AI 形式看起来就像过去的 Google 搜索。引用麦克卢汉的话：“我们总是通过后视镜驶入未来。”

Today, we see this as AI chatbots which mimic Google search boxes. We're now deep in that uncomfortable transition phase which happens with every new technology shift.

I don't have all the answers on what comes next. But I like to play with a few historical metaphors to think about how AI can work at different scales, from individuals to organizations to whole economies.

今天，我们将 AI 看作模仿搜索框的聊天机器人。我们正处于每项新技术转型中都会经历的那段令人不安的过渡期。

我并不知道接下来的全部答案。但我喜欢用几个历史隐喻来思考 AI 如何在不同维度发挥作用——从个人，到组织，再到整个经济体。

Individuals: from bicycles to cars 第三部分：个人层面——从自行车到汽车

The first glimpses can be found with the high priests of knowledge work: programmers.

My co-founder Simon was what we call a 10× programmer, but he rarely writes code these days. Walk by his desk and you'll see him orchestrating three or four AI coding agents at once, and they don't just type faster, they think, which together makes him a 30–40× engineer. He queues tasks before lunch or bed, letting them work while he's away. He's become a manager of infinite minds.

最初的征兆可以从知识工作的“高级祭司”——程序员身上看到。

我的联合创始人 Simon 曾是所谓的“10倍程序员”，但他现在很少写代码了。他在办公桌前同时指挥三四个 AI 编程代理（Agents），它们不仅打字快，而且会思考，这使他成为了“30-40倍工程师”。他成了“无限心智”的管理者。

A 1970s Scientific American study on locomotion efficiency inspired Steve Jobs's famous "bicycle for the mind" metaphor. Except we've been pedaling on the Information Superhighway for decades since.1970 年代《科学美国人》关于运动效率的一项研究，激发了史蒂夫·乔布斯著名的“大脑的自行车”这一比喻。只是此后的几十年里，我们一直在“信息高速公路”上蹬车。

In the 1980s, Steve Jobs called personal computers "bicycles for the mind." A decade later, we paved the "information superhighway" that is the internet. But today, most knowledge work is still human-powered. It's like we've been pedaling bicycles on the autobahn.

With AI agents, someone like Simon has graduated from riding a bicycle to driving a car.

When will other types of knowledge workers get cars? Two problems must be solved.

1980 年代，史蒂夫·乔布斯将个人电脑比作“大脑的自行车”。但今天，大多数知识工作仍是人力驱动的。这就像我们在高速公路上蹬自行车。有了 AI 代理，我们正从骑自行车进化到开车。

有了 AI 代理，像 Simon（Notion 联合创始人）这样的人已经从骑自行车进化到了开车。

其他类型的知识工作者何时能开上车？必须解决两个问题。

Compared with coding agents, why is it more difficult for AI to help with knowledge work? Because knowledge work is more fragmented and less verifiable.与编程代理相比，为什么 AI 辅助知识工作更难？因为知识工作更加碎片化，且更难验证。

First, context fragmentation. For coding, tools and context tend to live in one place: the IDE, the repo, the terminal. But general knowledge work is scattered across dozens of tools. Imagine an AI agent trying to draft a product brief: it needs to pull from Slack threads, a strategy doc, last quarter's metrics in a dashboard, and institutional memory that lives only in someone's head. Today, humans are the glue, stitching all that together with copy-paste and switching between browser tabs. Until that context is consolidated, agents will stay stuck in narrow use cases.

首先是“上下文碎片化”。对于编程，工具和背景信息通常存在于一个地方：IDE（集成开发环境）、仓库、终端。但通用的知识工作散落在几十个工具中。想象一个 AI 代理试图起草一份产品简介：它需要从 Slack 的讨论串、一份战略文档、仪表盘上的上季度指标，以及只存在于某人脑海中的机构记忆中提取信息。今天，人类充当了信息的粘合剂，通过复制粘贴和切换浏览器标签将一切缝合在一起。在上下文被整合之前，代理将始终停留在狭窄的应用场景中。

The second missing ingredient is verifiability. Code has a magical property: you can verify it with tests and errors. Model makers use this to train AI to get better at coding (e.g., reinforcement learning). But how do you verify if a project is managed well, or if a strategy memo is any good? We haven't yet found ways to improve models for general knowledge work. So humans still need to be in the loop to supervise, guide, and show what good looks like.

第二个缺失的要素是“可验证性”。代码拥有一种神奇的属性：代码拥有一种神奇的属性：你可以通过测试和错误反馈来验证它。模型制造者利用这一点来训练 AI，使其更擅长编程（例如强化学习）。但你如何验证一个项目是否管理得当，或者一份战略备忘录是否优秀？我们尚未找到改进通用知识工作模型的方法。因此，人类仍需留在环路中进行监督、引导，并定义什么是“优秀”。

The Red Flag Act of 1865 required a flag bearer to walk ahead of the vehicle while it drove down the street (repealed in 1896). An example of undesirable "human in the loop."1865 年的《红旗法案》要求在车辆行驶时，必须有一名旗手在车前步行引导（该法案于 1896 年废除）。这就是一个“不理想的人在环路中”的例子。

Programming agents this year taught us that having a "human-in-the-loop" isn't always desirable. It's like having someone personally inspect every bolt on a factory line, or walk in front of a car to clear the road (see: the Red Flag Act of 1865). We want humans to supervise the loops from a leveraged point, not be in them. Once context is consolidated and work is verifiable, billions of workers will go from pedaling to driving, and then from driving to self-driving.

今年的编程代理告诉我们，让“人在环路中”并不总是理想的。这就像让某人亲自检查工厂流水线上的每一颗螺栓，或者走在汽车前面清理道路。我们希望人类从一个具有杠杆作用的高点来监督环路，而不是置身其中。一旦上下文得到整合且工作可验证，数十亿工作者将从蹬车进化为开车，再从开车进化为自动驾驶。

Organizations: steel and steam 第四部分：组织层面：钢铁与蒸汽

Companies are a recent invention. They degrade as they scale and reach their limit.

公司是近代才出现的发明。它们会随着规模扩大而退化，并达到极限。

Organizational chart for the New York and Erie Railroad, 1855. The modern corporation and org chart evolved with the railroad companies, which were the first enterprises that needed to coordinate thousands of people across great distances.1855 年纽约和伊利铁路公司的组织架构图。现代公司和组织架构随铁路公司演变而来，它们是第一批需要在广阔地域协调数千人的企业。

A few hundred years ago, most companies were workshops of a dozen people. Now we have multinationals with hundreds of thousands. The communication infrastructure (human brains connected by meetings and messages) buckles under exponential load. We try to solve this with hierarchy, process, and documentation. But we've been solving an industrial-scale problem with human-scale tools, like building a skyscraper with wood.

几百年前，大多数公司只是十几个人的作坊。现在我们有了拥有数十万员工的跨国公司。这种通信基础设施（由会议和消息连接的人脑）在指数级的负载下发生崩塌。我们试图通过层级、流程和文档来解决这个问题。但我们一直在用人类尺度的工具解决工业尺度的问题，就像用木头盖摩天大楼。

Two historical metaphors show how future organizations can look differently with new miracle materials.

两个历史隐喻展示了在新奇迹材料的助力下，未来的组织会有何不同。

A wonder of steel: the Woolworth building was the tallest building in the world upon completion in NYC, 1913.

The first is steel. Before steel, buildings in the 19th century had a limit of six or seven floors. Iron was strong but brittle and heavy; add more floors, and the structure collapsed under its own weight. Steel changed everything. It's strong yet malleable. Frames could be lighter, walls thinner, and suddenly buildings could rise dozens of stories. New kinds of buildings became possible.

第一个是钢铁。在钢铁出现前，19 世纪的建筑受限于六七层楼。铁虽然坚固但易碎且沉重；如果增加层数，结构就会在自身重量下崩塌。钢铁改变了一切。它既坚固又具延展性。框架可以更轻，墙体可以更薄，突然之间，建筑可以拔高到几十层。新型建筑成为了可能。

AI is steel for organizations. It has the potential to maintain context across workflows and surface decisions when needed without the noise. Human communication no longer has to be the load-bearing wall. The weekly two-hour alignment meeting becomes a five-minute async review. The executive decision that required three levels of approval might soon happen in minutes. Companies can scale, truly scale, without the degradation we've accepted as inevitable.

AI 是组织的钢铁。它有潜力在跨流程中维持上下文，并在需要时排除噪音、浮现决策。人类的沟通不再必须充当承重墙。每周两小时的对齐会议变成了五分钟的异步审查。过去需要三级审批的高管决策，可能很快在几分钟内完成。公司可以实现真正的规模化，而不会出现我们以前认为不可避免的那种退化。

A mill with a water wheel to power its operations. Water was powerful but unreliable and restricted mills to a few locations and seasonality.

The second story is about the steam engine. At the beginning of the Industrial Revolution, early textile factories sat next to rivers and streams and were powered by waterwheels. When the steam engine arrived, factory owners initially swapped waterwheels for steam engines and kept everything else the same. Productivity gains were modest.

第二个故事关于蒸汽机。工业革命初期，早期的纺织厂坐落在河流溪水边，依靠水轮提供动力。当蒸汽机出现时，工厂主起初只是用蒸汽机换掉了水轮，其他一切照旧。效率提升很微小。

The real breakthrough came when factory owners realized they could decouple from water entirely. They built larger mills closer to workers, ports, and raw materials. And they redesigned their factories around steam engines. (Later, when electricity came online, owners further decentralized away from a central power shaft and placed smaller engines around the factory for different machines.) Productivity exploded, and the Second Industrial Revolution really took off.

真正的突破发生于工厂主意识到他们可以完全脱离水源。他们在靠近工人、港口和原材料的地方建造了更大的工厂。并围绕蒸汽机重新设计了工厂。后来，当电力普及，工厂主进一步去中心化，取消了中央传动轴，在不同机器旁放置了小型电机。生产力随之爆炸，第二次工业革命真正起飞。

This 1835 engraving by Thomas Allom depicts a textile factory in Lancashire, UK. It was powered by steam engines.

We're still in the "swap out the waterwheel" phase. AI chatbots bolted onto existing tools. We haven't reimagined what organizations look like when the old constraints dissolve and your company can run on infinite minds that work while you sleep.

我们现在还处于“换掉水轮”的阶段。只是把 AI 聊天机器人挂载在现有工具上。我们还没有重新构想，当旧的限制溶解、你的公司可以依靠那些在你睡觉时仍在工作的“无限心智”运行时，组织会是什么样子。

At my company Notion, we have been experimenting. Alongside our 1,000 employees, more than 700 agents now handle repetitive work. They take meeting notes and answer questions to synthesize tribal knowledge. They field IT requests and log customer feedback. They help new hires onboard with employee benefits. They write weekly status reports so people don't have to copy-paste. And this is just baby steps. The real gains are limited only by our imagination and inertia.

在我的公司 Notion，我们一直在做实验。除了我们的 1000 名员工外，现在还有 700 多个代理在处理重复性工作。它们负责记录会议纪要并回答问题，以合成团队内部知识。它们处理 IT 请求并记录客户反馈。它们协助新员工入职并了解员工福利。它们撰写每周状态报告，让人类不再需要复制粘贴。而这仅仅是起步。真正的收益只受限于我们的想象力和惯性。

Economies: from Florence to megacities 第五部分：经济——从佛罗伦萨到超级城市

Steel and steam didn't just change buildings and factories. They changed cities.

钢铁和蒸汽不仅改变了建筑和工厂，它们改变了城市。

Until a few hundred years ago, cities were human-scaled. You could walk across Florence in forty minutes. The rhythm of life was set by how far a person could walk, how loud a voice could carry.

Then steel frames made skyscrapers possible. Steam engines powered railways that connected city centers to hinterlands. Elevators, subways, highways followed. Cities exploded in scale and density. Tokyo. Chongqing. Dallas.

直到几百年前，城市还是“人类尺度”的。你可以在 40 分钟内横穿佛罗伦萨。生活的节奏由一个人的步行距离和嗓门大小决定。随后，钢架让摩天大楼成为可能。蒸汽机驱动的铁路将市中心与腹地连接起来。电梯、地铁、高速公路紧随其后。城市的规模和密度爆炸式增长。东京、重庆、达拉斯。

These aren't just bigger versions of Florence. They're different ways of living. Megacities are disorienting, anonymous, harder to navigate. That illegibility is the price of scale. But they also offer more opportunity, more freedom. More people doing more things in more combinations than a human-scaled Renaissance city could support.

这些城市不仅仅是放大版的佛罗伦萨，它们是完全不同的生存方式。超级城市让人迷失、匿名、难以导航。这种“不可读性”是规模化带来的代价。但它们也提供了更多的机会和自由。

I think the knowledge economy is about to undergo the same transformation.

我认为知识经济也即将经历同样的转型。

Today, knowledge work represents nearly half of America's GDP. Most of it still operates at human scale: teams of dozens, workflows paced by meetings and email, organizations that buckle past a few hundred people. We've built Florence's with stone and wood.

今天，知识工作占美国 GDP 的近一半。其中大部分仍以人类尺度运行：几十人的团队、由会议和邮件设定的工作节奏、一旦超过几百人就会崩塌的组织。

When AI agents come online at scale, we'll be building Tokyo. Organizations that span thousands of agents and humans. Workflows that run continuously, across time zones, without waiting for someone to wake up. Decisions synthesized with just the right amount of human in the loop.

当 AI 代理大规模上线时，我们将建造知识经济领域的“东京”。组织将跨越数以千计的代理和人类。工作流将跨越时区持续运行，无需等待某人醒来。

It will feel different. Faster, more leveraged, but also more disorienting at first. The rhythms of the weekly meeting, the quarterly planning cycle, and the annual review may stop making sense. New rhythms emerge. We lose some legibility. We gain scale and speed.

这感觉会很不一样：速度更快，杠杆力更强，但起初也会让人感到更迷茫。周会、季度计划和年度评审的节奏可能不再有意义。新的节奏将会出现。我们失去了一些可读性。我们赢得了规模和速度。

Beyond the waterwheels 第六部分：超越水轮

Every miracle material required people to stop seeing the world via the rearview mirror and start imagining the new one. Carnegie looked at steel and saw city skylines. Lancashire mill owners looked at steam engines and saw factory floors free from rivers.

每一种奇迹材料都要求人们停止通过后视镜看世界，并开始构想新世界。卡内基通过钢铁看到了城市天际线。兰开夏郡的工厂主通过蒸汽机看到了脱离河流限制的工厂。

We are still in the waterwheel phase of AI, bolting chatbots onto workflows designed for humans. We need to stop asking AI to be merely our copilots. We need to imagine what knowledge work could look like when human organizations are reinforced with steel, when busywork is delegated to minds that never sleep.

Steel. Steam. Infinite minds. The next skyline is there, waiting for us to build it.

我们仍处于 AI 的“水轮阶段”，将聊天机器人强加于为人类设计的流程中。我们需要停止要求 AI 仅仅作为我们的“副驾驶”。我们需要想象，当人类组织由钢铁加固、繁杂工作被交给那些永不眠的心智时，知识工作会是什么样。钢铁、蒸汽、无限心智。下一个天际线就在那里，等待我们去构建。