Fleet command
runs the mission
- The Conductor
conductor - The Librarian
librarian
A fleet of 23 specialized agents documents your legacy estate, designs the target, then builds, tests and operates it on Fabric or Databricks. The loop runs until the tests pass. Your experts sign off at every gate.
Modernizing a Microsoft data estate still mostly happens by hand. Someone reads through hundreds of pipelines and stored procedures, then rebuilds them on Fabric or Databricks. It's slow, and most of what makes it work lives in a few people's heads. Worse, the hardest components tend to surface right before the deadline, exactly when they cost the most to fix.
The fleet inventories your assets, scores their complexity and documents what they actually do, then builds, tests and runs the result on the platform you choose. Same migration, re-engineered: a loop that corrects itself instead of a backlog that keeps growing.
The payoff: you start faster, plan with real numbers, spend less, and your team doesn't fall apart when one expert leaves. On Fabric or Databricks.
Document → Architect → Build → Test → Operate. Agents iterate autonomously inside the build-test-run loop; your experts approve the three gates that matter.
Documenters reconstruct the functional behavior of every legacy asset into the knowledge base; the Surveyor scores complexity and prioritizes the backlog.
The Architect turns the knowledge base into a Target Design Spec, applying your naming, error-handling and orchestration standards.
Builders generate the assets for the chosen platform and flavor.
Test agents and the Reconciler verify the generated assets against the documented legacy behavior.
Operators deploy and run in DEV; failures go back to the builders and the loop repeats until the suite is green.
This is the headline feature. The Operator deploys to DEV and runs the pipelines for real. The Test agents and the Reconciler flag what doesn't match the documented legacy behavior. The builders patch it, and the loop runs again. It repeats autonomously until the suite is green, and every iteration is logged in the knowledge base.
Green on iteration 3: patched by the builder, verified by the Reconciler
Narrow agents beat one general model: each has one job, one input contract, one output contract. The Conductor sequences them; the knowledge base is their shared memory.
Every Documenter writes structured functional specs into one shared, versioned knowledge base, and everyone downstream reads from it: the Architect, the builders, the Test agents, the Operators. So before a single asset moves, you already have something most estates never had: honest, current documentation of what your pipelines actually do. That alone kills key-person risk.
Control flow, data flow, run semantics and SQL logic, reconstructed from the native definition.
Source-to-target lineage across staging hops, SCD patterns and surrogate keys.
Script logic, dynamic SQL and unsupported connectors flagged with a weighted score.
The Librarian curates and versions every entry. The knowledge base is the customer's.
# kb/assets/dwh_load_dimcustomer.yaml asset: DWH_Load_DimCustomer source: tool: ssis package: DimCustomer.dtsx documented_by: ssis-documenter behavior: pattern: scd_type_2 control_flow: - { task: TRN Staging, type: execute_sql } - { task: DFT Load, type: data_flow, components: 7 } - { task: UPS Dimension, type: execute_sql } run_semantics: incremental, watermark ModifiedDate lineage: [SRC_CRM.Customer, STG.Customer, DWH.DimCustomer] risk: score: 0.62 drivers: [script_task, dynamic_sql] targets: fabric-dbt: models/marts/dim_customer.sql status: stage: test iteration: 3 last_fix: "row-count delta in SCD backfill, patched by fabric-dbt-builder" gates: assessment: approved design: approved promotion: pending
A lift-and-shift hands you working pipelines and not much else. The fleet hands you the platform plus a living layer around it: source-to-target lineage, a data dictionary, navigable transformations and the tests that guard them. It's generated as the build happens, so it's accurate on day one, and it's yours to keep.
A sample: every model traced from source system to final mart, the whole estate as one graph.
The fleet documents once (the knowledge base doesn't care where you're going) and builds for the platform and flavor you choose.
dbt For teams standardizing on dbt, the open-source standard that's free to run. The builders generate a full dbt project, your layers, your macros, your tests.
your framework For teams that built their own way of working over the years. The builders generate into your existing patterns, naming, error handling and orchestration.
Migrate to Fabric →dbt For teams standardizing on dbt, the open-source standard that's free to run. The builders generate a full dbt project, your layers, your macros, your tests.
your framework For teams that built their own way of working over the years. The builders generate into your existing patterns, naming, error handling and orchestration.
Migrate to Databricks →Not sure which? Compare Microsoft Fabric and Databricks →
dbt or your own framework? See when to choose which →
Nothing is a black box. The fleet emits structured, inspectable outputs at each control point, from estate scoring to a validation report with its iteration log. The three artifacts below are illustrative samples, not one customer's data.
Agents iterate autonomously inside the build-test-run loop in DEV. That's the whole point of the Operators. But three gates stay mandatory and human: this is where your experts hold the migration accountable.
Your experts sign off the knowledge base + complexity scorecard, then the Architect designs the target.
Your experts sign off the Target Design Spec, then the builders generate the assets.
Your experts sign off the validation report + iteration log, then promotion through your existing DevOps process.
Automated documentation replaces weeks of manual inventory, and you keep the docs.
Consistent, agent-scored complexity turns guesswork into a data-driven roadmap.
The build-test-run loop catches and fixes defects automatically, before UAT.
dbt or your framework: your conventions are the spec, not a casualty of the move.
Inspectable artifacts at every step: knowledge base, Target Design Spec, validation reports.
Codified intelligence removes key-person risk and keeps knowledge in the fleet.
Inside the build-test-run loop, yes. Operator agents deploy to DEV, run the pipelines, diagnose failures and route fixes back to the builders, then re-run, iterating until the tests pass, without waiting on a human. Around the loop, no. Your experts approve three gates: the assessment after documentation, the target design, and promotion toward production. Autonomous in the loop; accountable at the gates.
Seven legacy workloads: SSIS, Azure Data Factory, T-SQL (stored procedures, views and functions), and all four Azure Synapse workloads: pipelines, notebooks, dedicated SQL pools and serverless SQL. Each is read by a specialist Documenter that reconstructs what the asset actually does into the knowledge base.
Both are first-class targets, and the right answer depends on your estate and your strategy. Because the Document stage is target-agnostic, you can document first and decide with evidence. The Surveyor's complexity scorecard turns the question into numbers for your estate. We stay neutral until your data isn't. Compare Microsoft Fabric and Databricks →
Then the builders generate dbt: your layers, your macros, your tests, on Microsoft Fabric or Databricks. The dbt flavor is for teams standardizing on dbt, and the generated project is meant to look like one your team would have written.
It survives. The Architect encodes your conventions (naming, error handling, orchestration) into the Target Design Spec, and the builders generate into your existing patterns. That is the whole point of the your-framework flavor: the years you spent building your way of working are an asset to keep, not a casualty of the migration.
Structured functional specs of every documented asset: its behavior, lineage, risk score and migration status. It is yours. And it doubles as the documentation your estate never had: complete, current documentation of what your pipelines actually do, delivered even before a single asset moves.
That is what the loop is for. Test agents and the Reconciler compare generated assets against documented legacy behavior, Operators surface runtime failures, and the builders patch and re-run until the suite is green. Nothing passes the final gate without a validation report (including its iteration log) that your experts sign.
Through your existing DEV → ACC → PRD DevOps process. Agents iterate to a green build in DEV; the validation report then supports business sign-off before the assets promote through your own pipeline. The fleet fits your release process; it doesn't replace it.
It is part of the Target Design Spec, not an afterthought. Source permissions and identities are mapped to the target's native model: Unity Catalog catalogs, grants and lineage on Databricks, or workspace roles and OneLake security on Microsoft Fabric, with access governed through Microsoft Entra ID. The Architect records the intended grants at the design gate, so your security owners approve them before anything is built.
No. Your packages, pipelines and data are analyzed for your migration only. They are never used to train models, and your IP stays yours.
No procurement marathon, no big upfront commitment. We start small, prove the approach on your own estate, and only then scope the full migration.
A short conversation about your SSIS, ADF, Synapse or T-SQL landscape and what a successful move to Fabric or Databricks looks like for you.
The Surveyor documents a representative slice of your estate and hands back a complexity scorecard and a clear risk map.
A prioritized, data-driven roadmap with effort, risk and a predictable, gated path to your chosen platform.
Your packages and pipelines are analyzed for your migration, never used to train models.
Human sign-off is mandatory at assessment, design and promotion.
Assets promote through your existing DEV → ACC → PRD DevOps pipeline.
Tell us about your SSIS, ADF, Synapse or T-SQL estate and we'll share our approach, success stories, and how the agent fleet would apply to you.
Plan my migrationA short form, no spam. We usually reply within one business day.