🏷️

infrastructure

16 articles tagged with "infrastructure"

Tech Feeds

ai-agents-tool-use

How to make your e-commerce product visible to AI agents? Use this new system trusted by L’Oréal, Unilever, Mars & Beiersdorf

For future-focused e-commerce brands, the primary customer is rapidly changing from a person behind a screen to the AI agents that said human customer deploys on their behalf to research and, if proje...

other

Shadow mode, drift alerts and audit logs: Inside the modern audit loop

Traditional software governance often uses static compliance checklists, quarterly audits and after-the-fact reviews. But this method can't keep up with AI systems that change in real time. A machine...

other

Nvidia, Groq and the limestone race to real-time AI: Why enterprises win or lose here

From miles away across the desert, the Great Pyramid looks like a perfect, smooth geometry — a sleek triangle pointing to the stars. Stand at the base, however, and the illusion of smoothness vanishe...

ai-agents-tool-use

AI agents turned Super Bowl viewers into one high-IQ team — now imagine this in the enterprise

The average Fortune 1000 company has more than 30,000 employees and engineering, sales and marketing teams with hundreds of members. Equally large teams exist in government, science and defense organi...

building-ai-products

TTT-Discover optimizes GPU kernels 2x faster than human experts — by training during inference

Researchers from Stanford, Nvidia, and Together AI have developed a new technique that can discover new solutions to very complex problems. For example, they managed to optimize a critical GPU kernel...

building-ai-products

The ‘brownie recipe problem’: why LLMs must have fine-grained context to deliver real-time results

Today’s LLMs excel at reasoning, but can still struggle with context. This is particularly true in real-time ordering systems like Instacart. Instacart CTO Anirban Kundu calls it the /'brownie recipe...

building-ai-products

Shared memory is the missing layer in AI orchestration

The key to successful AI agents within an enterprise? Shared memory and context. This, according to Asana CPO Arnab Bose, provides detailed history and direct access from the get-go — with guardrail...

dev-tooling-dx

Enterprises are measuring the wrong part of RAG

Enterprises have moved quickly to adopt RAG to ground LLMs in proprietary data. In practice, however, many organizations are discovering that retrieval is no longer a feature bolted onto model inferen...

building-ai-products

Most RAG systems don’t understand sophisticated documents — they shred them

By now, many enterprises have deployed some form of RAG. The promise is seductive: index your PDFs, connect an LLM and instantly democratize your corporate knowledge. But for industries dependent on h...

open-source-drops

This tree search framework hits 98.7% on documents where vector search fails

A new open-source framework called PageIndex solves one of the old problems of retrieval-augmented generation (RAG): handling very long documents. The classic RAG workflow (chunk documents, calculate...

ai-agents-tool-use

The era of agentic AI demands a data constitution, not better prompts

The industry consensus is that 2026 will be the year of /'agentic AI./' We are rapidly moving past chatbots that simply summarize text. We are entering the era of autonomous agents that execute tasks. W...

prompt-engineering-evals

Why LinkedIn says prompting was a non-starter — and small models was the breakthrough

LinkedIn is a leader in AI recommender systems, having developed them over the last 15-plus years. But getting to a next-gen recommendation stack for the job-seekers of tomorrow required a whole new t...

dev-tooling-dx

How infrastructure outages in 2025 changed how businesses think about servers

In 2025, many companies learned a practical lesson about infrastructure reliability. What stood out was not that failures happened — outages have always existed — but how broadly and deeply their impa...

building-ai-products

Stop calling it 'The AI bubble': It's actually multiple bubbles, each with a different expiration date

It’s the question on everyone’s minds and lips: Are we in an AI bubble? It's the wrong question. The real question is: Which AI bubble are we in, and when will each one burst? The debate over whether...

prompt-engineering-evals

Why reinforcement learning plateaus without representation depth (and other key takeaways from NeurIPS 2025)

Every year, NeurIPS produces hundreds of impressive papers, and a handful that subtly reset how practitioners think about scaling, evaluation and system design. In 2025, the most consequential works w...

building-ai-products

When protections outlive their purpose: A lesson on managing defense systems at scale

User feedback led us to clean up outdated mitigations. See why observability and lifecycle management are critical for defense systems. The post When protections outlive their purpose: A lesson on man...