I helped a fintech company turn their multi-agent AI system from an architectural design into a working product. The platform is a portfolio management system where users describe what they need in plain language, and AI agents build and execute the entire data workflow automatically, like Zapier for financial data, but powered by LLMs.
Client: Fintech (PMS)
Portfolio managers and analysts working with digital assets need to monitor positions across multiple exchanges, track performance metrics like time-weighted returns, detect risk signals early, and get notified when something needs attention. Traditionally, each of these workflows requires a data engineer to build and maintain custom pipelines.
The client, a fintech company, had built a portfolio management system (PMS) and wanted to let users create these workflows through natural language. Not just ask questions about their data, but actually generate executable pipelines: scheduled alerts, interactive dashboards, automated reports, conditional notifications. Think Zapier or n8n, but for financial portfolio data, where the workflows are built by AI agents from a single conversational prompt.
The team had initially considered migrating to n8n as the workflow engine, but decided to build a custom platform instead because the domain requirements (financial entity hierarchies, typed data operations, real-time portfolio computations) were too specific for a generic workflow tool.
There's also a constraint that sets financial platforms apart from most AI applications: auditability. In finance, every action the platform takes must be reproducible and traceable for compliance and legal reasons. This is why the system generates deterministic workflows rather than answering questions on the fly. Once AI agents build a workflow, it runs on fixed rails through Dagster, with every step logged, every input and output recorded. This approach has a double advantage: it makes everything auditable, and it minimizes hallucination risk by grounding each execution step on structured, validated data rather than free-form LLM output.
The platform had been designed and partially implemented when I joined, but it was not yet functional. The architecture was in place, the agent roles were defined, but the system couldn't reliably produce working workflows. Prompts broke with unexpected inputs. Agents picked the wrong tools or generated code that didn't match the data structures coming from upstream steps. My job was to make it actually work and get it to production.
I came in as an AI consultant to take the system from designed-but-not-working to production. My work focused on the AI layer: making agents produce correct outputs, coordinate effectively, and handle real-world inputs reliably.