The DreamFactory AI Academy teaches you to connect ChatGPT, Claude, Cursor, or any AI agent to enterprise databases, with full security, RBAC, identity passthrough, and audit logging built in. Self-hosted. Your data never leaves your infrastructure.
The AI Academy teaches you to combine these layers into a single governed workflow. No new infrastructure, just new capability on top of DreamFactory.
Data stays put. Governance travels with every request. And you go from question to answer in minutes, not quarters. REST APIs, not SQL. Identity passthrough, not service accounts. Your security infrastructure, not ours.
Every DreamFactory API (every database connection, every stored procedure, every RBAC role) is a building block for AI. The gap isn't infrastructure. It's knowledge.
We built the AI Academy to bridge that gap. Whether you're already running DreamFactory in production or exploring it for the first time, you'll learn how to turn any database into a secure, AI-ready data source in minutes, not months.
With DreamFactory's native MCP Server, your APIs can be consumed by AI tools like ChatGPT, Claude, Cursor, and local LLMs, with full RBAC and identity passthrough intact. The Academy teaches you how.
"DreamFactory has allowed us to remove three or four layers of complexity out of handling the AI." Enterprise AI Implementation Team
AI tools like ChatGPT, Claude, or Cursor can connect directly to your DreamFactory instance via MCP. No new infrastructure, just a config pointing to your existing endpoints.
Identity passthrough means AI queries run as the actual user, not a shared service account. Your RBAC, stored procedure boundaries, and audit logs apply automatically. Research shows all five tested LLM applications were vulnerable to prompt-to-SQL injection. DreamFactory's parameterized queries prevent this by design.
One team eliminated $250K+ in vendor engagements and reduced 140MB of legacy code to kilobytes. Building your own LLM data layer costs $1.5M+ and takes 12-18 months. DreamFactory collapses that to days. AI responses return in 4-16 seconds.
RAG accuracy for structured enterprise data falls below 60%. Deterministic API queries through DreamFactory achieve 97%. No vector databases, no stale embeddings, no data duplication. Just real-time, governed access to your live data.
DreamFactory CTO Kevin McGahey connects ChatGPT to a live sales database through DreamFactory's MCP Server in under 5 minutes. No code. No SQL. Just plain English questions.
Each module maps to a real engineering blog post and a hands-on project. Build AI-powered features on your own DreamFactory instance, or follow along with a free trial.
Connect ChatGPT, Claude, Cursor, or any AI agent to your enterprise database through DreamFactory's AI Data Gateway, with full access controls and audit logging built in. Zero code. Zero SQL. Just questions and answers.
Configure DreamFactory's native MCP Server (v1.1.0) so AI tools like ChatGPT, Claude, or Cursor can discover and query your databases, with OAuth 2.0 + PKCE authentication and full RBAC.
Understand DreamFactory's layered security model: Role-Based Access Control, service-level permissions, table and field restrictions, and API key management. This is the foundation that governs every AI interaction.
Learn why letting AI generate raw SQL is dangerous and how stored procedures create security contracts that make AI data access predictable, auditable, and safe.
When an AI queries data on behalf of a user, who runs the query? Learn to implement identity passthrough so AI respects user permissions and generates meaningful audit trails.
Understand how REST APIs work under the hood, from Node.js/Express patterns to DreamFactory's zero-code generation, and how to design endpoints that AI agents consume efficiently.
Great API docs aren't just for humans anymore. Learn to generate documentation that both developers and AI agents can parse, discover, and act on reliably.
Walk through a real production implementation: connect a self-hosted LLM to SQL Server through DreamFactory's AI Data Gateway to generate AI-powered data summaries for any use case.
Real-world examples showing how every department uses DreamFactory's AI Gateway to drive revenue, cut costs, and make better decisions with natural language.
AI-driven lead generation, upsell intelligence, and competitive analysis
6 use casesCollections prioritization, three-way matching, month-end close acceleration
6 use casesNatural language production queries, demand forecasting, OEE optimization
5 use casesEmployee evaluations in seconds, certification tracking, workforce planning
4 use casesLegacy system AI enablement, multi-system unification, API standardization
3 use casesNatural language dashboards, defect analytics, cross-division comparison
4 use casesOrganized by department and by industry. Find use cases built for your world.
Copy-paste prompts optimized for each DreamFactory capability. Use with Claude, ChatGPT, Cursor, or any MCP-connected AI assistant.
When a new team gets DreamFactory access, their first question is always 'what data can I reach?' This maps every connected service in one call. IT teams use it during onboarding. Executives use it to understand the scope of AI-accessible data across the organization.
Before building any query or pipeline, you need to understand the schema. This reveals table relationships, column types, and row counts, replacing the meeting with a DBA. Essential whenever someone says 'I think that data is in the ERP somewhere.'
For the moment when users describe data by business name ('customer complaints', 'vendor payments') but nobody knows which of the 12 connected databases holds it. Searches every system simultaneously.
The workhorse query for every department. Sales uses it for account lookups. Finance uses it for AR aging. Operations uses it for production data. The 1,000 record cap ensures AI never tries to download your entire database.
When finance needs quarterly revenue summaries or operations needs production totals by shift, this runs server-side aggregation so AI processes summary numbers, not raw rows. Critical for performance with large datasets.
The foundation of every serious AI pipeline. Stored procedures give you deterministic, auditable, pre-approved queries that AI cannot modify. This is how a DreamFactory customer generates 8,000 employee evaluations in 16 seconds.
When AI pipelines need to read configuration files, process uploaded documents, or access reports stored in cloud storage. Distribution companies use this for EDI file processing. Manufacturing uses it for PLC data exports.
After mergers and acquisitions, companies need to inventory data across dozens of systems. This gives a complete picture of every table in every connected database, which is the first step in any data unification project.
Before AI can use stored procedures, someone needs to know what's available. This discovers every procedure across your databases, which becomes the menu of safe, pre-approved operations AI can execute in production pipelines.
The complete picture: every service, every database, every table, every procedure. CIOs use this during digital transformation planning. Auditors use it for compliance reviews. It's your entire data estate in one query chain.
The classic sales intelligence query. Account managers pull this to prep for client calls. Sales leadership uses it for pipeline reviews. Works against any CRM or ERP database connected through DreamFactory.
The executive's first 'aha' moment with AI. A CFO or CEO asks a plain-English question and gets an answer in seconds instead of requesting a report that takes days. This is typically the demo that sells AI projects internally.
Quarter-end reporting in seconds instead of hours. Finance teams use this to validate revenue numbers. Executives use it for board prep. The GROUP BY runs server-side, so AI gets summary data, not millions of raw records.
Quality teams in manufacturing use this to catch production anomalies. Healthcare uses it for patient safety outlier detection. The two-step pattern (aggregate for stats, then query for outliers) is a core QA workflow.
Production-grade query execution. When the query is too important to leave to AI's judgment, a stored procedure ensures the exact same logic runs every time. This is the backbone of the employee evaluation pipeline and every finance automation.
The write-back that makes AI pipelines real. AI doesn't just read data; it creates records. Sales pipelines write leads back to CRM. Finance pipelines write match records to ERP. HR pipelines write evaluation drafts to HRIS.
After acquisitions, data lives in multiple systems with no single source of truth. This pattern queries both, compares results, and finds discrepancies. Essential for data migration validation and ongoing reconciliation.
Legacy databases and NoSQL systems often lack proper foreign key relationships. Virtual FKs let AI follow data relationships that exist logically but not physically in the schema. Critical for legacy system AI enablement.
Before any pipeline write-back step, AI needs to know the exact column types and constraints to format data correctly. This prevents write failures from type mismatches or null violations.
When AI needs to understand the full query syntax available for a service. Particularly useful for technology teams building custom integrations or documenting their AI pipeline architecture.
Operations teams scanning for new data files on SFTP servers. Distribution companies checking for incoming EDI files. Manufacturing teams looking for PLC data exports. The first step in any file-based data pipeline.
AI reads and summarizes files without anyone downloading, opening, or reformatting them. Finance uses it for bank statement processing. Operations for production reports. The auto-detection handles CSV, JSON, and text natively.
The classic integration pattern: read a file, transform the data, write it to a database. This replaces manual data entry and custom ETL scripts. Distribution companies use it for EDI processing. Manufacturing for supplier data imports.
The output step of many AI pipelines. Finance writes exception reports to S3. HR writes evaluation summaries to shared drives. Sales writes lead lists for CRM import. Every department has a 'generate and save' workflow.
Compliance teams comparing this month's regulatory report against last month's. Financial services auditing quarterly filings. Healthcare reviewing patient outcome trends across reporting periods.
DevOps and IT teams managing application configuration across environments. Read a config, modify a setting, write it back. Particularly useful for managing DreamFactory service configurations programmatically.
Processing dozens of incoming files in one pass. Distribution companies handling daily EDI batch files. Manufacturing processing shift reports from multiple facilities. AI iterates through a directory and extracts insights from each file.
The output half of every reporting pipeline. Query the database, format results, save to cloud storage. Finance exports reconciliation reports. Sales exports CRM extracts. Operations exports production summaries.
After processing, old files need cleanup. Role-based permissions ensure only authorized users can delete files. Audit logging tracks every deletion. Essential for regulated industries with data retention policies.
Manufacturing quality teams analyzing images of product defects. Healthcare reviewing medical imaging metadata. AI receives images and audio as base64 for analysis without downloading files to local machines.
The first security step for any AI deployment. Sales gets read access to CRM tables. Finance gets access to AR data. The shop floor gets production data only. Each role sees exactly what it should and nothing more.
The enterprise security requirement: every AI query runs under the real user's database credentials, not a shared service account. This preserves existing database-level audit trails and permission models.
Territory-based data isolation for sales teams. Department-scoped access for HR. Region-restricted views for operations. Server-side WHERE clauses that AI cannot bypass or override, enforced on every request.
Setting up the secure connection between AI tools and DreamFactory. OAuth 2.0 + PKCE ensures tokens expire and refresh automatically. Critical for ChatGPT and Claude Desktop integrations.
Each AI tool gets its own API key with its own permissions. Claude gets one key, ChatGPT gets another, Cursor gets a third. If one key is compromised, only that tool's access is affected.
HIPAA and SOC 2 compliance requirement: mask SSN, salary, patient IDs, and other sensitive fields so AI can query employee or customer data without ever seeing PII. Essential for healthcare and financial services.
Enterprise SSO integration: your existing Azure AD groups automatically map to DreamFactory roles. When a new employee joins the Sales group, they immediately get the right AI data access without manual provisioning.
The write-safety pattern: AI can EXECUTE stored procedures but cannot directly INSERT, UPDATE, or DELETE from tables. All write operations go through pre-approved, tested code paths. This is how production AI pipelines stay safe.
For organizations using Okta, Auth0, or Azure AD as their identity provider. Tokens are validated server-side using the provider's public keys. No shared secrets to manage or rotate.
Token economics and cost control: limit how many requests each AI tool can make per minute. Prevents runaway AI queries from overwhelming your database. Different limits for different roles (admin vs. standard user).
Data teams exploring their Snowflake warehouse structure before building analytics pipelines. See every schema, table, and column type. Particularly useful when onboarding new analysts or auditing data warehouse coverage.
Business analysts querying Snowflake through natural language instead of writing SQL. The 500-record limit and server-side filtering ensure AI gets precisely the data it needs without pulling entire tables.
Revenue reporting, customer analytics, and operational metrics computed server-side in Snowflake. No raw data transferred. AI receives summary numbers and generates insights from aggregated results.
Production-grade Snowflake queries wrapped in stored procedures for repeatability and audit compliance. Finance teams use these for month-end reporting. Data teams use them for ETL validation.
The common enterprise pattern: analytics live in Snowflake but operational data lives in SQL Server or Oracle. AI queries both through DreamFactory and joins the results, creating a unified view without data migration.
Complex analytical workflows: first aggregate to identify top customers, then drill into their order patterns. AI chains multiple queries to build layered insights, mimicking the analytical process a human analyst would follow.
Inventorying available Snowflake procedures before building AI pipelines. Particularly useful for new team members learning what pre-built analytics and reporting procedures are available.
AI discovers the data model first, then builds reports intelligently. This two-step pattern ensures AI queries are schema-aware, reducing errors and producing more accurate reports.
DreamFactory adds an access control layer on top of Snowflake's native permissions. Analytics teams see aggregated data. Executives see everything. Individual contributors see only their department's data.
Writing AI-generated results back to Snowflake: processed scores, aggregated metrics, or AI classifications. Completes the pipeline loop where AI reads from Snowflake, processes data, and writes results back.
The production pattern from the employee evaluation use case: stored procedure collects data, DreamFactory serves it, Python script sends to LLM, returns AI summary. This is how a DreamFactory customer processes 8,000 evaluations in 16 seconds.
Catch bad data before it reaches your database. Validate required fields, check format consistency, enforce business rules. Every write-back step in an AI pipeline should have input validation as a safety net.
The alert mechanism for AI pipelines. When a new high-priority AR record is created, Slack the collections team. When a defect pattern is detected, alert the quality manager. Event-driven notifications without polling.
Regulatory compliance requirement for healthcare (HIPAA), financial services (SOC 2), and government (FedRAMP). Every data change captured with old values, new values, user identity, and timestamp.
Reshape data for specific consumers. Format Oracle date fields for a React frontend. Convert ERP codes to human-readable labels for AI consumption. Transform XML responses to JSON for modern applications.
The multi-source pipeline pattern: pull internal data, call an external API for enrichment (credit scores, weather data, market signals), merge the results, and save. This powers lead scoring and risk assessment workflows.
Build your own AI pipeline as a single API endpoint. Database query, prompt construction, LLM call, and response formatting all wrapped in one scripted service. This is the pattern behind the executive briefing generator.
Keep databases in sync across systems after mergers or integrations. The urllib timeout pattern prevents PHP-FPM worker pool deadlock, which is critical for production reliability.
The output step for executive dashboards, compliance reports, and operational summaries. Query data, format as PDF/CSV/JSON, upload to cloud storage. Automate the 'generate and distribute' workflow.
The compliance safety net: even if RBAC allows access to a table, this script strips sensitive fields before they reach AI. Critical for healthcare patient data, financial PII, and government personnel records.
Wrap any external API with DreamFactory security: RBAC, rate limiting, and audit logging. Your team uses one security model for both internal databases and external APIs.
Connect to SaaS platforms (Salesforce, HubSpot, Stripe) through DreamFactory, which handles token refresh automatically. No more expired token errors in production pipelines.
Secure internal API access through DreamFactory roles. Engineering gets full access. Sales gets read-only. Partners get a subset. One gateway, multiple audience-specific access patterns.
Add correlation IDs, auth tokens, or tracing headers to every proxied request automatically. Essential for distributed systems observability and debugging production issues.
The enrichment pattern: combine external API data (weather, market signals, credit scores) with internal database data in one response. This powers the lead targeting and risk scoring use cases.
Route AI model calls through DreamFactory for cost control and token tracking. Rate limit per department. Track token usage per role. The model routing pattern described in the executive dashboard use case.
Unify access to multiple external APIs under one security layer. Instead of managing credentials for each API separately, DreamFactory handles all authentication and access control centrally.
Auto-generate documentation for your proxied APIs so AI agents can discover endpoints and parameters. The more AI knows about available APIs, the smarter its query planning becomes.
Debug API integrations by logging every request and response. Monitor external API reliability. Track usage patterns. Essential during pipeline development and ongoing production monitoring.
Production monitoring for API health. Track response times, error rates, and throughput across all proxied services. Detect degradation before it impacts AI pipeline performance.
A production pattern used by enterprise teams, achieving 4-16 second response times while eliminating layers of vendor complexity.
"We discovered our 14 billion parameter model actually outperformed our 120 billion parameter model for summarization tasks. The smaller model focused on language and context without overthinking."
Lesson: Always benchmark your specific use case. Bigger isn't always better, and smaller models mean faster responses and lower infrastructure costs.
Note: This is the scripted-service orchestration pattern (Module 08). Alternatively, AI agents can call stored procedures directly via MCP using {db}_call_stored_procedure (see Modules 01 to 04).
Vector databases and embeddings work for unstructured documents, but your structured enterprise data deserves deterministic, real-time access.
Source: The API-First Alternative to RAG for Structured Data
Every Academy module is backed by a deep-dive blog post from the DreamFactory engineering team. Start here.
CTO Kevin McGahey connects ChatGPT to a live sales database through DreamFactory's MCP Server in under 5 minutes. No code, no SQL. Just plain English questions and instant answers.
Watch video →Connect Anthropic's Claude to your enterprise data through DreamFactory. Real-time analytics, trend analysis, and root cause exploration, all through natural language conversation.
Read article →How to give AI agents enough access to unlock business value without compromising privacy, compliance, or control. A 7-step repeatable governance workflow for enterprise teams.
Read article →Building your own costs $1.5M+ and takes 12-18 months. One Fortune 500 integrated 50 data sources in 2 weeks. A clear-eyed analysis of build vs. buy for AI data infrastructure.
Read article →All five tested LLM applications were vulnerable to prompt-to-SQL injection. How API abstraction, parameterized queries, and field-level masking prevent the attacks that keep CISOs up at night.
Read article →Step-by-step guide to configuring DreamFactory's native MCP Server. Connect ChatGPT, Claude, or any MCP client with OAuth 2.0 authentication. Most teams finish in under 10 minutes.
Read article →RAG accuracy for structured data falls below 60%. Deterministic API queries achieve 97%. Why the API-first approach eliminates vector databases, reduces costs 70-80%, and cuts escalations by 90%.
Read article →A 5-layer security framework for AI database access: governed REST APIs, identity passthrough with RBAC, deterministic queries, rate limiting, and MCP for local LLMs.
Read article →A real integration story: connecting a self-hosted AI model to SQL Server via DreamFactory scripted services. Covers timeouts, auth issues, and production patterns.
Read article →Why letting AI generate ad-hoc SQL is dangerous, and how stored procedures create the security contracts AI agents need for safe, auditable enterprise data access.
Read article →When AI queries data, who runs the query? Why shared service accounts break security, and how token forwarding keeps user identity intact through the entire pipeline.
Read article →The release that made DreamFactory natively AI-ready. Native MCP Server support, Azure AD group-to-role mapping, and critical security hardening.
Read article →Complete reference for MCP server setup, 22 tool patterns, OAuth 2.0 authentication, custom login pages, and FAQ.
Read docs →We're giving away 10 DGX Sparks to help enterprises build local AI connected to their data. New and existing customers are eligible. First 10 to commit get one.
First come, first served. Book a call to discuss your AI project and DGX Spark eligibility, or DM Terence Bennett on LinkedIn.
Your DreamFactory instance is already AI-ready. Connect any AI agent to your database in about 5 minutes, then let anyone in your organization ask questions in plain English. The Academy shows you how, from first query to production AI pipelines.