Architecture document

Byron on the GAI Platform

How Byron fits inside Northern Trust's GAI (Generative AI) platform — from identity at the bottom to mini-apps at the top — and how it unifies digital coworkers, business-unit agents, and ASTAC Copilot Studio agents into one governed surface.

0. Document purpose

This document is the architectural reference for the Byron@NT instance. It is intended for the Emerging Technologies team, Cloud Architecture, Information Security, and the Architecture Review Board. Where it overlaps with existing GAI platform standards (identity, secrets, observability, model gateway), GAI standards take precedence and Byron inherits them.

1. GAI is the substrate

The GAI platform is Northern Trust's enterprise generative-AI substrate. It provides: a model gateway with private routing to Anthropic, OpenAI (Azure), Google (Vertex), and NT-hosted open-weight models; an identity layer rooted in Entra ID and federated via Okta; a secrets vault; a shared eval harness; and an observability pipeline that lands in the enterprise SIEM. Byron consumes GAI rather than reimplementing any of these primitives.

Practically: Byron does not hold model keys directly. Every model call routes through the GAI gateway with the partner's identity, the workspace ID, and a budget tag attached. This is what lets Risk & Compliance get a single pane of attribution for AI usage across Byron 3.0, ASTAC, and any future intelligence initiative.

2. Layered architecture

From top to bottom:

Experience       Byron chat · Mini-apps · Embeds (Outlook, Teams, Salesforce, ServiceNow)
Orchestration    Intent router · Agent registry · Tool selection · Guardrails
Agents           ARTY · FARA · LARA · CORA · BU agents · ASTAC bridge
Models (BYOM)    Claude 4.6 · GPT-4.1 · Gemini 2.5 · NT-hosted Llama 3.1 70B
Retrieval        Hybrid RAG · Vector store · Workspace-scoped re-ranking
Data layer       MCP servers · REST/GraphQL clients · JDBC · SFTP gateways
GAI platform     Identity · Secrets · Model gateway · Telemetry · Eval · Cost
Systems          Salesforce FSC · GFAS · Eagle · Aladdin · ServiceNow · SharePoint · Workday

3. Data integration model

3.1 MCP servers

MCP (Model Context Protocol) servers are the preferred integration shape for any tool that an agent should be able to call on demand. We use MCP for Bloomberg and FactSet today, with Aladdin and Eagle on the near-term roadmap. Each MCP server is deployed in the NT tenancy, fronted by the GAI gateway, and emits structured tool-call telemetry.

3.2 REST & GraphQL APIs

Transactional systems (Salesforce FSC, ServiceNow, Workday) integrate via their first-party APIs through a typed client layer. Agents see these as named tools, not raw endpoints. Tool descriptions are reviewed by the owning BU before publication.

3.3 Snowflake (GAI data lake)

Certified data products in Snowflake are the source of truth for analytical queries. Byron issues parameterized SQL through a JDBC tool with row-level security enforced by Snowflake roles mapped to Entra groups. Free-form SQL is disabled by default.

3.4 Unstructured knowledge

SharePoint, Confluence, and OneDrive are ingested into a workspace-scoped vector store. Embeddings are computed inside the NT tenancy. Re-ranking is hybrid (BM25 + dense). Retrieval results are filtered by the requesting partner's ACL.

3.5 User connectors

Partners can wire personal connectors (their own Outlook, OneDrive, GitHub) through OAuth flows that scope tokens to the individual user. These never leak across workspaces.

4. Agents & digital coworkers

Three classes of agents share the same runtime:

Digital coworkers — ARTY, FARA, LARA, CORA — enterprise-known personas that any partner can converse with directly.
Business-unit agents — Wealth Management, Asset Management, Asset Servicing, C&IS, Risk, COO, HR — each scoped to that BU's tools, data, and personas.
Platform agents — the ASTAC bridge that federates agents authored in Microsoft Copilot Studio so they appear natively in Byron.

5. Orchestration & intent routing

The chat home is the single entry point. When a partner sends a message, the orchestrator runs intent classification across (a) registered mini-apps, (b) registered agents, and (c) the default general assistant. If a mini-app matches with high confidence, Byron offers a one-click handoff that pre-fills the mini-app with extracted parameters. Otherwise it routes to the best agent for the message, or answers directly.

The orchestrator is transparent. Every routing decision shows its reasoning in the conversation so partners learn the system, and so Risk & Compliance can audit why a given agent was chosen.

6. BYOM & model routing

Each workspace pins a default model. Each agent can override it. Each conversation can override that. The model gateway in GAI enforces region (EU prompts stay in EU regions, etc.), egress policy (no prompts to providers outside the approved list), and cost ceiling. Sensitive-classification prompts route to NT-hosted Llama by default.

7. Governance, audit & cost

Every prompt, every tool invocation, every document retrieved, every model response, and every cost-bearing token is captured with the partner identity, workspace, agent, and correlation ID. The stream lands in the enterprise SIEM and in a Snowflake audit schema available to Risk & Compliance. Per-user, per-workspace, and per-BU budgets are enforced in real time by the GAI gateway.

8. Rollout roadmap

Q3 2026   GA for Asset Servicing (FARA + Fund Accounting mini-app), Risk (LARA)
Q4 2026   Wealth Management (CORA + Review Book), C&IS (RFP assistant), ASTAC bridge GA
Q1 2027   Asset Management agent, Aladdin MCP server, NT-hosted Llama default for restricted classifications
Q2 2027   Embedded surfaces in Outlook + Salesforce, partner-built mini-apps via Cowork

Ready to see it?

Open the demo environment and try the orchestrator.

Enter demo →