🏷️

orchestration

32 articles tagged with "orchestration"

Tech Feeds

other Mar 6, 2026

Google Workspace CLI brings Gmail, Docs, Sheets and more into a common interface for AI agents

What's old is new: the command line — the original, clunky non-graphical interface for interacting with and controlling PCs, where the user just typed in raw commands in code — has become one of the m...

building-ai-products Mar 6, 2026

Google PM open-sources Always On Memory Agent, ditching vector databases for LLM-driven persistent memory

Google senior AI product manager Shubham Saboo has turned one of the thorniest problems in agent design into an open-source engineering exercise: persistent memory. This week, he published an open-sou...

ai-agents-tool-use Mar 6, 2026

LangChain's CEO argues that better models alone won't get your AI agent to production

As models get smarter and more capable, the /'harnesses/' around them must also evolve. This /'harness engineering/' is an extension of context engineering, says LangChain co-founder and CEO Harrison C...

building-ai-products Mar 6, 2026

New KV cache compaction technique cuts LLM memory 50x without accuracy loss

Enterprise AI applications that handle large documents or long-horizon tasks face a severe memory bottleneck. As the context grows longer, so does the KV cache, the area where the model’s working memo...

other Feb 27, 2026

Enterprise MCP adoption is outpacing security controls

AI agents now carry more access and more connections to enterprise systems than any other software in the environment. That makes them a bigger attack surface than anything security teams have had to...

building-ai-products Feb 27, 2026

OpenAI's big investment from AWS comes with something else: new 'stateful' architecture for enterprise agents

The landscape of enterprise artificial intelligence shifted fundamentally today as OpenAI announced $110 billion in new funding from three of tech's largest firms: $30 billion from SoftBank, $30 billi...

building-ai-products Feb 26, 2026

8 billion tokens a day forced AT&T to rethink AI orchestration — and cut costs by 90%

When your average daily token usage is 8 billion a day, you have a massive scale problem. This was the case at AT&T, and chief data officer Andy Markus and his team recognized that it simply wasn’t...

building-ai-products Feb 25, 2026

Anthropic just released a mobile version of Claude Code called Remote Control

Claude Code has become increasingly popular in the first year since its launch, and especially in recent months, as developers and non-technical users alike flock to AI unicorn Anthropic's hit coding...

building-ai-products Feb 23, 2026

One engineer made a production SaaS product in an hour: here's the governance system that made it possible

Every engineering leader watching the agentic coding wave is eventually going to face the same question: if AI can generate production-quality code faster than any team, what does governance look like...

other Feb 22, 2026

Shadow mode, drift alerts and audit logs: Inside the modern audit loop

Traditional software governance often uses static compliance checklists, quarterly audits and after-the-fact reviews. But this method can't keep up with AI systems that change in real time. A machine...

ai-agents-tool-use Feb 20, 2026

Runlayer is now offering secure OpenClaw agentic capabilities for large enterprises

OpenClaw, the open source AI agent that excels at autonomous tasks on computers and which users can communicate with through popular messaging apps, has undoubtedly become a phenomena since its launch...

ai-agents-tool-use Feb 18, 2026

New agent framework matches human-engineered AI systems — and adds zero inference cost to deploy

Agents built on top of today's models often break with simple changes — a new library, a workflow modification — and require a human engineer to fix it. That's one of the most persistent challenges in...

other Feb 15, 2026

Nvidia, Groq and the limestone race to real-time AI: Why enterprises win or lose here

From miles away across the desert, the Great Pyramid looks like a perfect, smooth geometry — a sleek triangle pointing to the stars. Stand at the base, however, and the illusion of smoothness vanishe...

ai-agents-tool-use Feb 13, 2026

AI agents turned Super Bowl viewers into one high-IQ team — now imagine this in the enterprise

The average Fortune 1000 company has more than 30,000 employees and engineering, sales and marketing teams with hundreds of members. Equally large teams exist in government, science and defense organi...

building-ai-products Feb 12, 2026

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

Researchers at Nvidia have developed a technique that can reduce the memory costs of large language model reasoning by up to eight times. Their technique, called dynamic memory sparsification (DMS), c...

building-ai-products Feb 11, 2026

Anthropic’s Claude Cowork finally lands on Windows — and it wants to automate your workday

Anthropic released its Claude Cowork AI agent software for Windows on Monday, bringing the file management and task automation tool to roughly 70 percent of the desktop computing market and intensifyi...

building-ai-products Feb 11, 2026

MIT's new fine-tuning method lets LLMs learn new skills without losing old ones

When enterprises fine-tune LLMs for new tasks, they risk breaking everything the models already know. This forces companies to maintain separate models for every skill. Researchers at MIT, the Improba...

ai-agents-tool-use Feb 10, 2026

OpenAI upgrades its Responses API to support agent skills and a complete terminal shell

Until recently, the practice of building AI agents has been a bit like training a long-distance runner with a thirty-second memory. Yes, you could give your AI models tools and instructions, but afte...

building-ai-products Feb 6, 2026

What the OpenClaw moment means for enterprises: 5 big takeaways

The /'OpenClaw moment/' represents the first time autonomous AI agents have successfully /'escaped the lab/' and moved into the hands of the general workforce. Originally developed by Austrian engineer P...

ai-assisted-coding Feb 4, 2026

Kilo CLI 1.0 brings open source vibe coding to your terminal with support for 500+ models

Remote-first AI coding startup Kilo doesn't think software developers should have to pledge their undying allegiance to any one development environment — and certainly not any one model or harness. T...

building-ai-products Feb 4, 2026

Mistral drops Voxtral Transcribe 2, an open-source speech model that runs on-device for pennies

Mistral AI, the Paris-based startup positioning itself as Europe's answer to OpenAI, released a pair of speech-to-text models on Wednesday that the company says can transcribe audio faster, more accur...

building-ai-products Feb 4, 2026

The ‘brownie recipe problem’: why LLMs must have fine-grained context to deliver real-time results

Today’s LLMs excel at reasoning, but can still struggle with context. This is particularly true in real-time ordering systems like Instacart. Instacart CTO Anirban Kundu calls it the /'brownie recipe...

building-ai-products Feb 2, 2026

Shared memory is the missing layer in AI orchestration

The key to successful AI agents within an enterprise? Shared memory and context. This, according to Asana CPO Arnab Bose, provides detailed history and direct access from the get-go — with guardrail...

dev-tooling-dx Feb 1, 2026

Enterprises are measuring the wrong part of RAG

Enterprises have moved quickly to adopt RAG to ground LLMs in proprietary data. In practice, however, many organizations are discovering that retrieval is no longer a feature bolted onto model inferen...

building-ai-products Jan 31, 2026

Most RAG systems don’t understand sophisticated documents — they shred them

By now, many enterprises have deployed some form of RAG. The promise is seductive: index your PDFs, connect an LLM and instantly democratize your corporate knowledge. But for industries dependent on h...

ai-agents-tool-use Jan 30, 2026

AI models that simulate internal debate dramatically improve accuracy on complex tasks

A new study by Google suggests that advanced reasoning models achieve high performance by simulating multi-agent-like debates involving diverse perspectives, personality traits, and domain expertise....

ai-agents-tool-use Jan 26, 2026

Claude Code's 'Tasks' update lets agents work longer and coordinate across sessions

One of the biggest constraints currently facing AI builders who want to deploy agents in service of their individual or enterprise goals is the /'working memory/' required to manage complex, multi-stage...

ai-agents-tool-use Jan 25, 2026

The era of agentic AI demands a data constitution, not better prompts

The industry consensus is that 2026 will be the year of /'agentic AI./' We are rapidly moving past chatbots that simply summarize text. We are entering the era of autonomous agents that execute tasks. W...

building-ai-products Jan 23, 2026

Everything in voice AI just changed: how enterprise AI builders can benefit

Despite lots of hype, /'voice AI/' has so far largely been a euphemism for a request-response loop. You speak, a cloud server transcribes your words, a language model thinks, and a robotic voice reads t...

building-ai-products Jan 23, 2026

Claude Cowork turns Claude from a chat tool into shared AI infrastructure

Claude Cowork is now available to more Claude users, alongside new updates aimed at team workflows. Anthropic made Claude Cowork accessible to users on Team and Enterprise plans, and it brings the pla...

building-ai-products Jan 20, 2026

MIT’s new ‘recursive’ framework lets LLMs process 10 million tokens without context rot

Recursive language models (RLMs) are an inference technique developed by researchers at MIT CSAIL that treat long prompts as an external environment to the model. Instead of forcing the entire prompt...

prompt-engineering-evals Jan 17, 2026

Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025)

Every year, NeurIPS produces hundreds of impressive papers, and a handful that subtly reset how practitioners think about scaling, evaluation and system design. In 2025, the most consequential works w...