orchestration
32 articles tagged with "orchestration"
Tech Feeds
Google Workspace CLI brings Gmail, Docs, Sheets and more into a common interface for AI agents
What's old is new: the command line — the original, clunky non-graphical interface for interacting with and controlling PCs, where the user just typed in raw commands in code — has become one of the m...
Google PM open-sources Always On Memory Agent, ditching vector databases for LLM-driven persistent memory
Google senior AI product manager Shubham Saboo has turned one of the thorniest problems in agent design into an open-source engineering exercise: persistent memory. This week, he published an open-sou...
LangChain's CEO argues that better models alone won't get your AI agent to production
As models get smarter and more capable, the /'harnesses/' around them must also evolve. This /'harness engineering/' is an extension of context engineering, says LangChain co-founder and CEO Harrison C...
New KV cache compaction technique cuts LLM memory 50x without accuracy loss
Enterprise AI applications that handle large documents or long-horizon tasks face a severe memory bottleneck. As the context grows longer, so does the KV cache, the area where the model’s working memo...
Enterprise MCP adoption is outpacing security controls
AI agents now carry more access and more connections to enterprise systems than any other software in the environment. That makes them a bigger attack surface than anything security teams have had to...
OpenAI's big investment from AWS comes with something else: new 'stateful' architecture for enterprise agents
The landscape of enterprise artificial intelligence shifted fundamentally today as OpenAI announced $110 billion in new funding from three of tech's largest firms: $30 billion from SoftBank, $30 billi...
8 billion tokens a day forced AT&T to rethink AI orchestration — and cut costs by 90%
When your average daily token usage is 8 billion a day, you have a massive scale problem. This was the case at AT&T, and chief data officer Andy Markus and his team recognized that it simply wasn’t...
Anthropic just released a mobile version of Claude Code called Remote Control
Claude Code has become increasingly popular in the first year since its launch, and especially in recent months, as developers and non-technical users alike flock to AI unicorn Anthropic's hit coding...
One engineer made a production SaaS product in an hour: here's the governance system that made it possible
Every engineering leader watching the agentic coding wave is eventually going to face the same question: if AI can generate production-quality code faster than any team, what does governance look like...
Shadow mode, drift alerts and audit logs: Inside the modern audit loop
Traditional software governance often uses static compliance checklists, quarterly audits and after-the-fact reviews. But this method can't keep up with AI systems that change in real time. A machine...
Runlayer is now offering secure OpenClaw agentic capabilities for large enterprises
OpenClaw, the open source AI agent that excels at autonomous tasks on computers and which users can communicate with through popular messaging apps, has undoubtedly become a phenomena since its launch...
New agent framework matches human-engineered AI systems — and adds zero inference cost to deploy
Agents built on top of today's models often break with simple changes — a new library, a workflow modification — and require a human engineer to fix it. That's one of the most persistent challenges in...
Nvidia, Groq and the limestone race to real-time AI: Why enterprises win or lose here
From miles away across the desert, the Great Pyramid looks like a perfect, smooth geometry — a sleek triangle pointing to the stars. Stand at the base, however, and the illusion of smoothness vanishe...
AI agents turned Super Bowl viewers into one high-IQ team — now imagine this in the enterprise
The average Fortune 1000 company has more than 30,000 employees and engineering, sales and marketing teams with hundreds of members. Equally large teams exist in government, science and defense organi...
Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy
Researchers at Nvidia have developed a technique that can reduce the memory costs of large language model reasoning by up to eight times. Their technique, called dynamic memory sparsification (DMS), c...
Anthropic’s Claude Cowork finally lands on Windows — and it wants to automate your workday
Anthropic released its Claude Cowork AI agent software for Windows on Monday, bringing the file management and task automation tool to roughly 70 percent of the desktop computing market and intensifyi...
MIT's new fine-tuning method lets LLMs learn new skills without losing old ones
When enterprises fine-tune LLMs for new tasks, they risk breaking everything the models already know. This forces companies to maintain separate models for every skill. Researchers at MIT, the Improba...
OpenAI upgrades its Responses API to support agent skills and a complete terminal shell
Until recently, the practice of building AI agents has been a bit like training a long-distance runner with a thirty-second memory. Yes, you could give your AI models tools and instructions, but afte...
What the OpenClaw moment means for enterprises: 5 big takeaways
The /'OpenClaw moment/' represents the first time autonomous AI agents have successfully /'escaped the lab/' and moved into the hands of the general workforce. Originally developed by Austrian engineer P...
Kilo CLI 1.0 brings open source vibe coding to your terminal with support for 500+ models
Remote-first AI coding startup Kilo doesn't think software developers should have to pledge their undying allegiance to any one development environment — and certainly not any one model or harness. T...
Mistral drops Voxtral Transcribe 2, an open-source speech model that runs on-device for pennies
Mistral AI, the Paris-based startup positioning itself as Europe's answer to OpenAI, released a pair of speech-to-text models on Wednesday that the company says can transcribe audio faster, more accur...
The ‘brownie recipe problem’: why LLMs must have fine-grained context to deliver real-time results
Today’s LLMs excel at reasoning, but can still struggle with context. This is particularly true in real-time ordering systems like Instacart. Instacart CTO Anirban Kundu calls it the /'brownie recipe...
Shared memory is the missing layer in AI orchestration
The key to successful AI agents within an enterprise? Shared memory and context. This, according to Asana CPO Arnab Bose, provides detailed history and direct access from the get-go — with guardrail...
Enterprises are measuring the wrong part of RAG
Enterprises have moved quickly to adopt RAG to ground LLMs in proprietary data. In practice, however, many organizations are discovering that retrieval is no longer a feature bolted onto model inferen...
Most RAG systems don’t understand sophisticated documents — they shred them
By now, many enterprises have deployed some form of RAG. The promise is seductive: index your PDFs, connect an LLM and instantly democratize your corporate knowledge. But for industries dependent on h...
AI models that simulate internal debate dramatically improve accuracy on complex tasks
A new study by Google suggests that advanced reasoning models achieve high performance by simulating multi-agent-like debates involving diverse perspectives, personality traits, and domain expertise....
Claude Code's 'Tasks' update lets agents work longer and coordinate across sessions
One of the biggest constraints currently facing AI builders who want to deploy agents in service of their individual or enterprise goals is the /'working memory/' required to manage complex, multi-stage...
The era of agentic AI demands a data constitution, not better prompts
The industry consensus is that 2026 will be the year of /'agentic AI./' We are rapidly moving past chatbots that simply summarize text. We are entering the era of autonomous agents that execute tasks. W...
Everything in voice AI just changed: how enterprise AI builders can benefit
Despite lots of hype, /'voice AI/' has so far largely been a euphemism for a request-response loop. You speak, a cloud server transcribes your words, a language model thinks, and a robotic voice reads t...
Claude Cowork turns Claude from a chat tool into shared AI infrastructure
Claude Cowork is now available to more Claude users, alongside new updates aimed at team workflows. Anthropic made Claude Cowork accessible to users on Team and Enterprise plans, and it brings the pla...
MIT’s new ‘recursive’ framework lets LLMs process 10 million tokens without context rot
Recursive language models (RLMs) are an inference technique developed by researchers at MIT CSAIL that treat long prompts as an external environment to the model. Instead of forcing the entire prompt...
Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025)
Every year, NeurIPS produces hundreds of impressive papers, and a handful that subtly reset how practitioners think about scaling, evaluation and system design. In 2025, the most consequential works w...