architecture
26 articles tagged with "architecture"
Tech Feeds
Your AI Agent Doesn't Need More Memory Tools. It Needs to Learn to Introspect.
The pattern I keep seeing in agent systems: The main agent has 12 memory-related tool calls available to it. It uses them inconsistently. Sometimes it searches, sometimes it forgets to. Sometimes it w...
How I Rebuilt My AI Decision Tool From a Summarizer Into a Constraint-Driven Arbitrator
A few weeks ago, I shipped a tool called Arbiter that takes a business decision, runs it through GPT-4o, and returns a structured analysis. The output looked impressive. Recommendation, confidence sco...
How We Finally Solved Test Discovery
How We Finally Solved Test Discovery Yesterday I wrote about why test file discovery is still unsolved. Three approaches (stem matching, content grepping, hybrid), each failing differently. The hybr...
I stopped trusting AI agents to βdo the right thingβ - so I built a governance system
I got tired of trusting AI agents. Every demo looks impressive. The agent completes tasks, calls tools, writes code and makes decisions. But under the surface thereβs an uncomfortable truth. You donβt...
Micro Frontends & The Hidden Code Sharing Problem
How teams go from copy-paste chaos β internal npm packages β a monorepo that finally makes sense. β‘ TL;DR β Micro frontends give teams autonomy, but create a hidden code sharing problem. Internal npm...
Node.js Event-Driven Architecture in Production: EventEmitter, Custom Buses, and Event Sourcing
Node.js Event-Driven Architecture in Production: EventEmitter, Custom Buses, and Event Sourcing Event-driven architecture isn't a trend β it's how Node.js was designed to work. The event loop, strea...
Full-Link Accountability for AI Agents
Core Event Primitives Four standard event types (J, D, V, T) cover the full accountability lifecycle: J: Judge β Create and initiate a judgment/decision D: Delegate β Transfer authority or assign...
How to Use HL7 FHIR API: Complete Healthcare Integration Guide (2026)
TL;DR HL7 FHIR (Fast Healthcare Interoperability Resources) is the modern standard for healthcare data exchange, using RESTful APIs with JSON/XML responses. It provides standardized resources for pa...
Seeing the problem: An Introduction to Separation of Concerns
Separation of concerns is one of the first topics that comes up when we want to move from writing code that simply works to writing code that is structured well. At first, I thought it only meant spli...
production-style infrastructure: ALB module infra-modules + infra-live separation environment-based deployment
ποΈ FINAL STRUCTURE terraform-production-lab/ β βββ infra-modules/ β βββ vpc/ β βββ security-group/ β βββ ec2/ β βββ alb/ β βββ infra-live/ βββ dev/ βββ prod/ infra-modules β reu...
Salesforce Email Sync: A Technical Guide for Developers and Admins
Salesforce Email Sync is one of those features that looks straightforward on the surface but has enough technical depth to trip up even experienced admins. If you're building on top of Salesforce or c...
The Death of Transient Memory: Engineering a Zero-Cost B2B LLM Edge Cluster
A functional local inference node is merely a prototype. An observable, stateful inference node is enterprise infrastructure. The current standard of wrapping quantized LLMs in basic FastAPI endpoints...
An Analogy to Help Understand Mixture of Experts
If you're having a hard time understanding MoE strength vs dense models, and roughly where they might land when comparing them, think about this super oversimplified analogy. I'm hoping it makes sense...
My Django Rapid Architecture short overview
The other day, I was browsing Reddit, and found an Architecture proposal for Django projects called βDjango Rapid Architectureβ. Itβs a small document with a few guidelines or principles. Iβm fond of...
6,000 AWS accounts, three people, one platform: Lessons learned
This post describes why ProGlove chose a account-per-tenant approach for our serverless SaaS architecture and how it changes the operational model. It covers the challenges you need to anticipate arou...
Understanding n8n from a System Design Perspective
Over the past few months, while researching how to build low-cost AI-driven systems as a solo developer, I started looking at n8n not as a no-code tool, but as an architectural component. This article...
Manticore Search on Microsoft Azure: DX1's Story
TL;DR: - DX1 uses Manticore Search for customer and parts search with a fast typeahead UX - Chosen for open-source licensing and speed - Deployed on Azure VMs running Ubuntu, aligned with DX1βs...
Announcing the AWS Digital Sovereignty Well-Architected Lens
As organizations accelerate cloud adoption, meeting digital sovereignty requirements has become essential to build trust with customers and regulators worldwide. The challenge isnβt whether to adopt t...
Apache Data Lakehouse Weekly: January 15β22, 2026
Get Data Lakehouse Books: Apache Iceberg: The Definitive Guide Apache Polaris: The Defintive Guide Architecting an Apache Iceberg Lakehouse The Apache Iceberg Digest: Vol. 1 Lakehouse Community: Join...
Decoupling the AI Stack: How to Architect a Production-Grade Local LLM System
From /'Localhost/' to /'On-Premise/': An open-source blueprint for building a privacy-first, scalable AI infrastructure with vLLM and LiteLLM. We are currently living in the /'Golden Age/' of Local AI. Tool...
[AWS] 5. High Availability and Scalability, Elastic Load Balancer (ELB), Auto Scaling Groups (ASG)
Scalability & High Availability Scalability means that an application / system can handle greater loads by adapting There are two kinds of scalability: Vertical Scalability Horizontal Scalability...
From Startup to Unicorn: A Blueprint for Secure Enterprise Architecture
This article presents a Modular Hybrid Architecture for secure enterprise applications, balancing speed and stability for startups. It emphasizes a Reactive Security Flow using Spring Boot, Redis, and JWT, while advocating for a multi-schema database strategy to optimize costs and maintain security.
Serverless MCP: Stateless Execution for Enterprise AI Tools
In the first two posts of this series, we explored why enterprise MCP needs compositional architecture and how to design skills that abstract complexity from the AI agent. But there's a question we ha...
How strong fundamentals + AI helped me build a data pipeline platform in 6 months (solo)
The article outlines how a solid foundation in software engineering principles, combined with AI tools like Claude, enabled the author to rapidly develop a data pipeline platform. Key insights include the importance of Infrastructure as Code, event-driven architecture, and continuous refactoring to maintain code quality and leverage AI effectively.