Home
Products

Intelligent Workflow Automation Platform
Explore FLIP

FLIP Navigation

Overview
Enterprise Workflow Automation Platform

Use Cases
Enterprise Use Cases Handled by FLIP

AI Workforce
Suite of Autonomous AI Agents

Security & Governance
Built for Compliance & Trust

Why FLIP
Why Choose FLIP

Pricing
Tiered Packages, Usage-based Fees

Calculate Your Migration ROI Now
Use Cases
AI-governed Reliable Data Flows & Invoice Processing

AP Automation
Eliminate manual invoice processing delays

DataOps
Automate data pipelines for faster delivery

Data Platform Migration
Migrate to modern data platforms faster

AI Invoice Processing
AI-powered invoice approvals with accuracy

Insurance Claims automation
Faster, accurate, end-to-end processing.

Trade Document Processing
Automated Trade Document Processing

Bank Statement Processing
Simplified Bank File Reconciliation

EDI Integration
Smart EDI Integration, Powered by AI

AI Agents
Autonomous AI Agents Built for You

Alan
AI legal summarizer that processes and condenses lengthy legal documents

Mike
AI quantitative proofreader that catches arithmetic errors

Susan
AI PII redactor that automatically removes sensitive information

Karl
Data insights agent that analyzes data and delivers quick insights

Ember
Automate customer service ops, resolve issues faster

AI-Powered Digital Twins for Preventive Maintenance
Register Now
Services

AI Services
Automate Decisions, Predict Outcomes, and Act Faster With Purposeful AI

Agentic AI
Deploy autonomous agents for task execution

Generative AI
Generate content and automate workflows instantly

AI Consulting
Expert AI consulting services, from strategy to deployment,

AI Strategy
Find where AI fits and build the roadmap.

Intelligent Automation
Intelligent Bots Streamline Repetitive Workflows

AI Governance
Governance That Powers Faster AI Innovation

AI Application Development
Ship production apps powered by AI.

RAG Development
Intelligent Retrieval for Smarter Decisions

AI Model Development
Build custom models for specific problems.

LLM Development
Build real products on language models.

MLOps Consulting
Keep models running reliably in production.

ML Consulting
Apply machine learning to business problems.
Data Services
Automate Decisions, Predict Outcomes, and Act Faster With Purposeful AI

Data Platform Migrations
Drive innovation and smarter decisions with AI.

Data Analytics
Unlock actionable intelligence from your data

Data Integration
Unify disparate data sources seamlessly

Data Governance
Ensure compliant, secure data management

Azure Cloud Solutions
Scale and innovate with AI-powered Azure solutions.

Predictive Analytics
Forecast demand faster and with precision

Data Engineering
Build pipelines that deliver clean data.

Data Strategy
Align data with goals worth measuring.

Data Modernization
Move off legacy platforms to cloud

Data Architecture
Design data platforms that scale.
Migration Accelerators
Automate & Accelerate Your Modernization Journeys

Azure to Microsoft Fabric
Consolidate analytics infrastructure for unified insights

Cognos to Microsoft Power BI
Transition BI tools with preserved dashboards seamlessly

Crystal Reports to Microsoft Power BI
Modernize legacy reports with advanced BI features

Alteryx to Microsoft fabric
Upgrade analytics workflows with Fabric capabilities

Informatica to Databricks
Build Lakehouse ETL pipelines for modern analytics

Informatica to Alteryx
Enable self-service analytics with automated conversion

Informatica to Microsoft fabric
Consolidate data integration into Fabric workflows

Informatica to Talend
Streamline ETL transitions with preserved business logic

SQL services to Microsoft Fabric
Modernize databases into unified analytics platform

SSRS to Microsoft Power BI
Convert server reports to interactive Power BI.

Tableau to Microsoft Power BI
Reduce costs, boost integration with Microsoft ecosystem

UiPath to Power Automate
Cut costs, boost efficiency, unlock seamless M365 integration
Technologies
Leading Platform Expertize to Enable Your Growth Goals

Microsoft Fabric
Integrate all data analytics end-to-end seamlessly

Microsoft Power BI
Visualize insights with interactive dashboards and reports

Microsoft Purview
Unified data governance, security, and compliance.

Databricks
Scale analytics on an enterprise unified Lakehouse

Snowflake
Store, query, and analyze large-scale data, all in one platform.

AI-Powered Digital Twins for Preventive Maintenance
Register Now
Industries

Industries
Industry Expertise Delivering Your Sector's Critical KPIs

Automotive
Accelerate production, optimize operations, create smarter CX.

Banking
Transform operations seamlessly with secure & compliant analytics.

Healthcare
Modernize systems, automate workflows, make faster decisions.

Insurance
Automate claims, enhance underwriting, personalize customer engagement.

Logistics & Supply Chain
Modernize operations for faster decisions, better forecasting.

Manufacturing
Boost production speed, reduce downtime, improve forecast accuracy.

Pharma
Accelerate research, improve efficiency, deliver faster.

Retail & FMCG
Digitize operations, automate tasks, deliver stronger customer connections.
AI Solutions

AI Agents
Autonomous AI Agents Built for You

Alan
AI legal summarizer that processes and condenses lengthy legal documents

Mike
AI quantitative proofreader that catches arithmetic errors

Susan
AI PII redactor that automatically removes sensitive information
AI for Enterprise
AI Solutions for Enterprise Workflows

Karl
Data insights agent that analyzes data and delivers quick insights

Ember
Automate customer service ops, resolve issues faster

DokGPT
Document intelligence agent that retrieves information instantly
AI for Business Roles
Optimize Core Business Processes for Scale with AI

Sales
Forecast revenue with AI precision

Finance
Automate reconciliation and financial reporting

Supply Chain
Optimize inventory and logistics routes

Operations
Boost efficiency through intelligent automation
AI for Industries
Industry Expertise Delivering Your Sector's Critical KPIs

AI Manufacturing
Smarter Production, Less Downtime

AI Pharma
Faster Innovation, Better Patient Outcomes

AI Insurance
Automate claims, underwriting, and policies

AI Logistics
Optimize routes, freight, and fulfillment

AI Automotive
Predictive maintenance, production, and quality

AI Healthcare
Enhanced patient and care operations

AI Banking
Faster decisions, smarter banking workflows

AI Retail
Smarter inventory, pricing, and demand

Microsoft Fabric Analyst in a Day
Register Now
Resources

Tools
Assessments & Calculators for Enterprises

AI Maturity Assessment
Evaluate your AI readiness & plan the next step

Migration ROI Calculator
Calculate your migration savings instantly
Resources
Insights Hub with Blogs, Tools, and Industry Resources.

Blogs
Stay ahead with the latest trends on Data & AI

Events & Webinars
Participate in leading events for knowledge & networking

Case studies
See proven transformation results from real client projects.

Whitepapers & Industry Reports
Step by step guidance to shape your Data & AI strategy

Infographics
Visualize complex concepts fast & clear

Videos
Demoes, case studies, thought leadership and more

Podcasts
Hear our experts dive deep to topics that matter

Datasheets
Cheat sheet to decode our solution capabilities

Knowledge Hub
Centralized learning resources

Glossaries
Master industry terminology

AI-Powered Digital Twins for Preventive Maintenance
Register Now
About

Company
Discover Our Mission and Opportunities

About us
Get to know our journey, vision, and the people behind us.

Contact us
Connect with us to discuss ideas, support needs, or partnerships.

Career
Build your career with us and grow through meaningful opportunities.

Newsroom
Discover company announcements, media mentions, and the latest updates.
Partners
Tech Partners Powering Your Digital Transformation

Enablers
Tech Enablers that Help us Power Your Digital Transformation

Microsoft
Accelerating data adoption to help organizations stay AI-ready.

Databricks
Powering Lakehouse analytics at scale for modern data-driven enterprises.

Snowflake
Simplify data modernization and accelerate analytics on Snowflake.

Microsoft Fabric Analyst in a Day
Register Now
Mobile

Call us
ROI Calculator
Contact Us
Instagram Facebook-f X-twitter Linkedin-in Youtube

+1 (855) 6-KANERI

Learn How AI-Powered Digital Twins help in Preventive Maintenance

Home Blogs Microsoft Fabric OneLake Catalog: Features, Setup Guide, and Best Practices

Microsoft Fabric OneLake Catalog: Features, Setup Guide, and Best Practices

TL;DR

The Microsoft Fabric OneLake catalog is a governance layer that replaced the legacy OneLake Data Hub, giving teams a searchable inventory of tables and models across workspaces with data quality scores, lineage, and endorsement labels.

Poor data quality costs organizations at least $12.9 million a year on average, according to Gartner. Most teams trace it to the same root cause, which is analysts building on the wrong data because they had no reliable way to find the right data.

A financial analyst joins a new team, opens Microsoft Fabric, and sees 340 tables across 12 workspaces with no descriptions, no labels, and no sign of which are production-ready. Three hours later they’re on Slack asking engineers. The certified semantic model they needed was there the whole time.

That is the problem the Microsoft Fabric OneLake catalog is designed to solve. In this article, we’ll cover its core features, setup guide, Purview comparison, industry use cases, and migration best practices.

Key Takeaways

OneLake catalog replaced the legacy OneLake Data Hub. It is a full governance platform with domains, quality scores, lineage, endorsement, and sensitivity labels, not just a search tool.
Data quality scores are the most underused feature. Organizations that configure DQ rules first see quick gains in report reliability and analyst trust.
OneLake catalog and Microsoft Purview are complementary, not competing. OneLake catalog is Fabric-scoped; Purview covers enterprise-wide, multi-cloud assets. Purview labels surface automatically inside the catalog.
External catalog federation (connecting to Databricks Unity Catalog and AWS Glue) is in preview and positions OneLake catalog as a multi-cloud discovery layer.
A basic catalog setup takes 1–2 weeks. A full enterprise implementation takes 6–8 weeks. The difference between those two is the difference between a governance checkbox and actual governance.
Catalog adoption directly cuts report rebuild rates. When analysts build on certified, quality-scored datasets, errors fall sharply.

Looking for Help With Your Microsoft Fabric OneLake Catalog Setup?

Get advice on catalog setup, governance configuration, and enterprise best practices.

Contact Us

What Is the Microsoft Fabric OneLake Catalog?

The Microsoft Fabric OneLake catalog is a centralized governance and discovery layer built on top of OneLake, Fabric’s unified storage layer. Every user in a tenant gets one place to find, endorse, and govern data assets across all workspaces. The catalog is also embedded in Microsoft Teams, Microsoft Excel, and Microsoft Copilot Studio, so discovery happens where people already work.

It replaced the legacy OneLake Data Hub, and the difference is substantial. The Data Hub was a search tool scoped to Power BI datasets, with basic endorsement and a single flat view. The catalog covers every Fabric item type across three tabs (Explore, Govern, and Secure). It adds domains, data quality scores, sensitivity labels, end-to-end lineage, Copilot-powered discovery, and external catalog federation. Purview governance signals surface inside it natively. It’s a fundamentally different kind of tool.

Core Features of the Microsoft Fabric OneLake Catalog

The catalog is built around ten capabilities that work together to make data discoverable, trustworthy, and governed. Each one addresses a specific gap that exists without a catalog layer in place.

1. Unified Data Asset Discovery Across 9 Fabric Item Types

The catalog covers lakehouses, warehouses, SQL analytics endpoints, semantic models, KQL databases, pipelines, notebooks, dataflows, and reports, all in one place. Users filter by workspace, domain, item type, endorsement status, sensitivity label, and data quality score. Access is role-scoped, so people only see items they’re authorized to view.

2. The Govern Tab: Governance Posture Across 4 Health Indicators

The Govern tab shows the health of everything you own in Fabric. It covers curation completeness, endorsement coverage, sensitivity label gaps, and recommended actions to close them. Fabric admins get a tenant-wide view of item counts, workspace counts, capacities, and domains. Note that insights refresh daily, not in real time, so factor that into same-day monitoring workflows.

3. The Secure Tab: Centralized Management Across All Workspace and OneLake Roles

The Secure tab gives admins a single place to audit permissions, review user access, and manage security roles. It covers both workspace and OneLake security scopes. Most teams use it to catch permission drift before it becomes a compliance issue.

4. Business Domains: Organize the Tenant Into Logical Data Areas

Domains group workspaces by business area, Finance, HR, Sales, Operations, Supply Chain, so analysts can filter straight to relevant assets without asking anyone. Each domain has its own admin, enabling federated governance without routing everything through central IT. Without domains, the catalog becomes a flat list that doesn’t scale past a handful of workspaces.

5. Catalog Endorsement: 3 Trust Levels for Every Item

Endorsement assigns every catalog item one of three trust levels. Promoted means the workspace owner recommends it. Certified means formal sign-off against production standards. Master data marks the authoritative source of record. Data consumers filter to Certified items and see only the validated ones, with a full certified-by trail for accountability.

Only users with Certifier permissions, assigned by Fabric admins in tenant settings, can certify items. Workspace owners can promote. Set this hierarchy at configuration time to prevent endorsement from becoming a free-for-all.

6. Data Quality Scoring: 5 Rule Dimensions, 0–100% Scores per Item

Data quality scores (0–100%) are generated by rules that check completeness, uniqueness, accuracy, consistency, and validity. Scores appear on every catalog item card, so analysts filter to items above a threshold before building a report rather than guessing. Despite this, scores are configured in fewer than 30% of Fabric deployments. Prioritize by consumption volume and business risk to make the effort manageable.

SKU note: Data quality scoring requires Fabric capacity. Check the licensing page for full SKU details.

7. Sensitivity Labels: 4-Tier Classification on Every Item Card

Sensitivity labels from Microsoft Purview flow automatically into the catalog and appear on every item card before access is attempted. The four tiers are Public, General, Confidential, and Highly Confidential. Governance teams can filter to all Highly Confidential items tenant-wide in one view, which is essential for GDPR, HIPAA, and SOX workflows. Enable label inheritance so items created from a labeled source carry that label forward automatically.

8. Data Lineage: Full Upstream and Downstream Traceability

The catalog traces the full path from data source through pipeline to lakehouse to semantic model to report. Teams use it for impact analysis before schema changes, audit traceability for compliance, and debugging data quality issues upstream. Lineage auto-generates for Fabric-native items. External sources like on-prem SQL Server and Salesforce need a Purview scanner configured, or they appear as dead ends in the lineage chain.

9. Copilot-Powered Discovery: 2 AI Capabilities Built Into the Catalog

Copilot adds two capabilities to the catalog. The first is AI-generated descriptions and the second is natural language search. Descriptions are auto-generated from schema and metadata; stewards review before publishing. Search lets users query in plain English: “find all certified customer tables in the Finance domain.” Both are available on all paid Fabric SKUs with Copilot enabled. Treat generated descriptions as a first draft, since complex schemas with abbreviated column names frequently need steward correction.

10. External Catalog Federation: Connect to 3 External Platforms (Preview)

External catalog federation connects OneLake catalog to Databricks Unity Catalog, AWS Glue Data Catalog, and Google Data Catalog, surfacing assets from those platforms alongside Fabric items in a single view. This feature is in preview as of early 2026. Don’t architect production multi-cloud governance strategies around it yet, but when it reaches GA, it turns OneLake catalog from a Fabric-only tool into a genuine enterprise-wide data catalog.

Microsoft Fabric Implementation and Governance

End-to-end Fabric delivery from tenant setup to catalog governance and Purview integration.

Explore the Services

How to Set up the Microsoft Fabric OneLake Catalog: Step-by-Step Guide

Each step builds on the last. Skipping ahead creates rework.

Step 1: Enable OneLake Catalog in Fabric Tenant Admin Settings

Navigate to the Fabric Admin Portal under Tenant Settings → Governance and insights settings. Settings to configure:

OneLake catalog: toggle On (tenant-wide)
Users can discover data using Microsoft Fabric: enable for all users or specific security groups
Certification: define who can certify items. Do this before any endorsement activity starts. Assign to a named security group, not individual users, so the permission set stays maintainable
External data sharing: configure whether catalog users can see externally shared items
Sensitivity label policies: confirm Purview Information Protection is connected and labels are visible in Fabric items

Admin Portal path: app.fabric.microsoft.com → Settings gear → Admin portal → Tenant settings

One thing to set correctly from the start is role-scoped access. Fabric Admins see everything tenant-wide. Domain Admins see all items in their domain. Workspace Admins, Members, and Contributors manage items in their workspaces. Viewers get read-only access. Users with no workspace role cannot see an item in the catalog at all. If someone says they can’t find a dataset, it almost always comes down to workspace membership, not catalog configuration.

Step 2: Create and Configure Business Domains

Navigate to: Admin Portal → Domains

Select Create domain
Name the domain (e.g., “Finance”, “Operations”, “Supply Chain”)
Assign domain admins, these users manage all workspace assignments and governance settings within the domain
Assign workspaces to the domain, each workspace can belong to only one domain
Set default sensitivity labels for the domain if desired

Start with five to seven domains mapped to how data is consumed, not how the org chart is structured. Finance and Accounting might be one org unit but two distinct data consumer communities. Subdivide later as usage patterns emerge.

Step 3: Configure the Endorsement Workflow and Certification Criteria

Before enabling endorsement, define what “Certified” means in your organization. Write it down. This matters more than the permission configuration.

Navigate to: Admin Portal → Tenant Settings → Endorsement and discovery

Enable Certified endorsement
Assign Certifier permissions to a security group
Communicate the certification criteria to workspace owners

Then, in individual workspaces, workspace owners can begin promoting items. Certifiers can certify items that meet the defined standard via the item settings panel in the catalog.

Step 4: Configure Data Quality Rules and Scoring

Navigate to: Fabric workspace → [Select a lakehouse or warehouse] → Data quality tab

Select the table to score
Choose a quality dimension: Completeness, Uniqueness, Accuracy, Consistency, or Validity
Define the rule logic (e.g., column X must be non-null in 99% or more of rows)
Set threshold: score above X% means Pass
Schedule rule execution
Scores appear on the catalog item card after first run

Start with your top 20 most-consumed items. A catalog with quality scores on 20 items is more useful than one with empty scores on 200. Prioritize production semantic models, certified lakehouses, and financial reporting tables.

Step 5: Verify Sensitivity Label Flow and Information Protection

Confirm Microsoft Purview Information Protection is configured at the Microsoft 365 tenant level
In Fabric Admin Portal, verify Information protection settings are enabled
Open the OneLake catalog and spot-check 10–15 items for label presence
Identify label gaps, particularly tables containing PII, financial data, or health information
Validate that label inheritance is working: create a derived item from a labeled source and confirm the label propagates

Under GDPR, knowing where personal data resides is a foundational requirement. Sensitivity label filtering in the catalog is exactly how you meet it.

Step 6: Enable Copilot and Drive Catalog Adoption Across the Organization

Confirm Copilot is enabled in Fabric Admin Portal under Tenant Settings → Copilot and Azure OpenAI Service
In the catalog, select high-traffic items → Edit description → Generate with Copilot
Data stewards review, edit, and approve generated descriptions
Run an analyst onboarding session: show how to filter to Certified and quality-scored assets
Set “start in the catalog” as the team standard for every new report development task

Track the ratio of reports built on certified vs. uncertified datasets. That number tells the real governance story more than item counts ever will. That ratio tells the real governance story, not the number of items in the catalog.

Phase	Step	Timeframe	Primary Owner	Success Signal
1	Tenant admin configuration	Days 1–2	Fabric Admin	All settings toggled; certifier security group assigned
2	Domain creation and workspace assignment	Week 1	Fabric Admin + Domain Admins	All production workspaces assigned to a domain
3	Endorsement workflow and certification criteria	Weeks 1–2	Governance Lead	Written certification criteria published; first items promoted
4	Data quality rule configuration	Weeks 2–4	Data Engineering + Stewards	Top 20 items scoring; scores visible on catalog cards
5	Sensitivity label verification	Weeks 3–4	Governance / Compliance	Zero unlabeled PII items; inheritance validated
6	Copilot enablement and adoption program	Weeks 4–6	Governance Lead + Business Leads	Catalog search is the default start point for report development

OneLake Catalog Vs. Microsoft Purview Data Catalog

A common governance question is whether teams that already have Purview still need the Microsoft Fabric OneLake catalog.

They are not the same thing. They are designed to work together. Microsoft Purview is an enterprise-wide governance platform covering Azure, AWS, GCP, on-prem, SaaS, and databases. The OneLake catalog is the Fabric-native discovery and governance experience that surfaces Purview’s signals at the consumption layer. Governance teams, CDOs, and compliance officers managing policy across an entire data estate use Purview. Data consumers, analysts, and engineers working inside Fabric every day use the OneLake catalog. Setup complexity reflects this difference. The catalog is built into Fabric and requires minimal configuration. Purview requires Data Map scanning setup and is a heavier investment.

A simple way to decide which to use is based on your data estate scope and governance maturity.

Fabric-first, early governance maturity: Start with OneLake catalog only. Build discipline before adding Purview complexity.
Mid-enterprise, Microsoft stack (Fabric + Azure SQL, ADF, some on-prem): Add Purview Information Protection labels early, but defer the full Purview Data Map until you have the adoption baseline.
Large enterprise, multi-cloud (AWS S3, Salesforce, Oracle, SAP): Run both. Purview as the governance backbone, OneLake catalog as the Fabric-layer experience.
Regulated industry (HIPAA, SOX, GDPR): Both are required regardless of estate complexity.
Migrating from a legacy environment: Pair OneLake catalog with FLIP migration accelerators to populate catalog metadata during migration, not after.

Purview also covers capabilities the OneLake catalog doesn’t handle natively. That includes a business glossary with term-to-asset assignment, automated sensitive data classification at scale, policy-based access control, governance of non-Fabric sources like Oracle, SAP, and Salesforce, and data stewardship workflows. For organizations in regulated industries, these aren’t optional. They’re where the compliance story gets built.

Kanerika infographic comparing Microsoft Fabric OneLake Catalog and Microsoft Purview.

How OneLake Catalog Governs OneLake Shortcuts

OneLake Shortcuts are a frequently overlooked interaction point with the catalog. Shortcuts created from external sources like ADLS Gen2, Amazon S3, or Google Cloud Storage appear as catalog items inside the lakehouse that hosts them. The governance behavior varies by signal type, as shown below.

Governance Signal	Fabric-to-Fabric Shortcut	ADLS Gen2 / Azure Storage	Amazon S3 / GCS	Action Required if Gap Exists
Sensitivity label	Inherited from parent lakehouse (if inheritance on)	Inherited from parent lakehouse (if inheritance on)	Inherited from parent lakehouse (if inheritance on)	Enable label inheritance in tenant admin settings
Data quality scoring	Not inherited: must configure rules on shortcut target	Not inherited: must configure rules on shortcut target	Not inherited: must configure rules on shortcut target	Configure DQ rules explicitly per shortcut item
Catalog visibility	Appears as item in the lakehouse hosting the shortcut	Appears as item in the lakehouse hosting the shortcut	Appears as item in the lakehouse hosting the shortcut	No action needed: automatic
Lineage (upstream)	Auto-generated within Fabric	Breaks at shortcut boundary without Purview	Breaks at shortcut boundary without Purview	Configure Purview scanner for external source
Endorsement	Must be set manually on the lakehouse	Must be set manually on the lakehouse	Must be set manually on the lakehouse	Include in endorsement workflow for the workspace

OneLake Catalog in Action: 5 Industry Use Cases

1. Financial Services: Certified Regulatory Datasets and SOX Audit Trails

Financial reporting teams spend significant time finding the right dataset and then proving it’s traceable. The catalog solves both. End-to-end lineage traces every number in a close report back to its source system automatically. Endorsement ensures analysts only build on formally validated production data. Sensitivity labels on all PII and account data block unauthorized discovery before access is even attempted.

In a recent bank engagement, KANComply mapped existing data assets to SOX control requirements inside the Purview governance layer, cutting audit preparation time by eliminating the manual evidence collection step. See the outcome in the sales financials KPIs case study.

2. Healthcare: PHI Protection and FDA-Ready Data Lineage

Clinical data teams need to know which datasets are safe to use for model training and be able to prove the data chain for regulatory validation. The catalog makes both manageable. All PHI-containing items are labeled Highly Confidential. Data scientists filter to Certified datasets before training, removing the risk of using uncertified or synthetic data by accident. Catalog lineage produces a traceable data flow from source to analytical output, supporting FDA 21 CFR Part 11 documentation.

KANComply maps HIPAA safeguards directly to Fabric items in the catalog, so compliance officers can see which regulatory controls each asset is mapped to, not just that it exists.

3. Manufacturing and Supply Chain: Domain-Organized Sensor Data and AI Agent Governance

Factories generate thousands of data streams across plants, lines, and sensors. Without a catalog layer, discovery becomes a spreadsheet problem. Domain structure organizes those streams by plant and production line, giving operations analysts a view that reflects how they think rather than how pipelines are built. AI agents like KARL that operate on production data require certified, quality-scored catalog assets as a prerequisite. When catalog governance breaks down, agent output breaks down with it.

Two native Fabric workloads debuted at FabCon 2026 on this exact premise. The approach held up in practice for a global packaging manufacturer and for a logistics client.

4. Insurance: Single Source of Truth for Claims and Underwriting Data

Insurance teams deal with overlapping data sources across claims, underwriting, and actuarial functions. Without endorsement, analysts can’t tell which version of a dataset is authoritative. Master data endorsement in the catalog designates the source of record for each entity. Lineage then confirms everything downstream traces back to it. The result is a a single source of truth for claims and underwriting that satisfies state insurance regulation requirements.

5. Retail: Real-Time Inventory and Sales Analytics on Certified Data

Retail analytics breaks down when inventory and sales data isn’t trusted, and the catalog’s DQ scores on certified semantic models give analysts a clear signal before they build. When teams migrate using FLIP, metadata populates the catalog during the migration itself, so they arrive at a governed catalog from day one rather than spending months backfilling documentation afterward. This played out directly for a retail and material handling client. For teams migrating from fragmented reporting environments, catalog-first delivery.

Why Most OneLake Catalog Deployments Stall and How to Fix Them

The Three Mistakes That Kill Catalog Value

Treating the catalog as a feature, not a governance program: Organizations enable OneLake catalog in the admin portal and consider it done. A proper Microsoft Fabric governance program is what separates configuration from value. But catalog value comes from coverage, endorsement discipline, and user adoption, not from the toggle being on. A common pattern is catalogs where 80% of assets have no endorsement, no quality scores, no descriptions. Technically configured; practically useless.

Building governance structure before there is data to govern: Some teams spend weeks designing domain hierarchies before any Fabric workloads are in production. Governance structure built for a dataset inventory that doesn’t exist yet ends up redesigned once real usage patterns emerge. Get two or three high-value workloads into production and certified first. Let governance structure follow real consumption patterns.

Skipping the data quality foundation: Without DQ rule configuration, every item in the catalog looks equal. Consumers cannot distinguish reliable from unreliable data and default to asking engineers, which is exactly the behavior the catalog is supposed to eliminate.

The Five Stages of Catalog Maturity

Most implementations follow a predictable progression. Knowing where you are helps you know what to fix next.

Level 1: Configured (Days 1–5): Catalog enabled, domains created, some items visible. No endorsement, no DQ scores, no labels, no adoption yet.
Level 2: Structured (Weeks 1–3): Workspaces assigned to domains, endorsement permissions set, top items promoted. DQ rules still missing; analysts still asking engineers.
Level 3: Trusted (Weeks 4–8): Certified items with DQ scores of 90%+; sensitivity labels on all PII items; lineage visible for production pipelines.
Level 4: Operationally Embedded (90+ days): Catalog is the default start for all report development; DQ rules on all Tier 1 items; Purview integration complete.
Level 5: Governed at Scale (6+ months): Business glossary terms mapped via Purview; external catalog federation active; governance KPIs tracked monthly.

When analysts build on trusted, quality-scored data, they rebuild reports less often. As Amit Chandak, Kanerika’s Chief Analytics Officer and Microsoft MVP in Power BI, puts it: “The catalog doesn’t govern your data. Your team does. The catalog makes it possible for your team to govern at scale without it becoming a full-time job.”

What Happens to Your Metadata When You Migrate to Microsoft Fabric

Existing catalog metadata doesn’t migrate automatically. The OneLake catalog starts from scratch, and for most organizations that is an opportunity. Legacy environments tend to accumulate years of inconsistent metadata, outdated endorsements, and governance debt. A Fabric migration is a natural forcing function to rebuild catalog governance with better structure than what came before.

The key is running a catalog population sprint in parallel with the technical migration rather than treating it as a post-migration activity that perpetually gets deferred. FLIP migration accelerators automate 50–60% of the technical migration work across 12 supported paths and generate structured metadata as part of the output. What that means in practice:

Item descriptions: generated from source metadata during migration, not written manually after
Lineage connections: built into the FLIP artifact set for SSIS, Azure Data Factory, Informatica, Alteryx, SSAS, and Oracle paths
Workspace organization: domain and workspace structure recommendations generated alongside the migration deliverables
Endorsement and DQ rules: not generated by FLIP, these require manual configuration post-migration, but with the metadata foundation already in place

The practical difference is arriving at a governed catalog at migration completion versus spending three months post-migration trying to document what was just built.

What Transfers Automatically and What You Need to Configure

Metadata Type	Migrates Automatically?	FLIP Accelerator Output	Manual Action Required
Table and column names	Yes: from source schema	Preserved during pipeline migration	None
Item descriptions / documentation	No	FLIP generates descriptions from source metadata where available	Steward review and enrichment
Lineage connections (Fabric-native)	Yes: auto-generated for Fabric items post-migration	Lineage connections generated in FLIP artifact set	Verify completeness post-migration
Lineage (external source connections)	No	Partial: FLIP documents source connections	Purview scanner configuration for external sources
Endorsement status	No: starts at zero	Not generated by FLIP	Must re-certify migrated items per new certification criteria
Data quality scores	No: rules do not transfer	Not generated by FLIP	DQ rules must be configured post-migration (use DQ prioritization framework)
Sensitivity labels	No: labels do not transfer	Not generated by FLIP	Re-apply via Purview label policies; enable inheritance
Domain assignment	No	Workspace organization recommendations in FLIP output	Assign workspaces to domains per domain architecture

OneLake Catalog Limitations to Know Before Implementation

External catalog federation is still maturing: Unity Catalog and AWS Glue connections are in preview as of early 2026. Not yet suitable for production governance architectures.
Private Link incompatibility: OneLake catalog is not compatible with Private Link configurations. Tenants using Private Link for network isolation cannot access the catalog, this is a known limitation to assess before implementation in high-security environments.
Lineage gaps for third-party sources: Lineage auto-generation works for Fabric-native items. Pipelines pulling from non-Fabric sources without Purview scanning will have broken lineage chains.
Data quality rules require real investment: Quality scores don’t appear automatically, someone has to define the rules. For organizations with hundreds of tables, this is a material time commitment.
No native business glossary: The catalog doesn’t include a business glossary with term-to-asset assignment. That capability requires Microsoft Purview.
Limited access request workflow: The catalog surfaces assets but doesn’t handle access request workflows or data product packaging natively. Those require Purview or custom workflow additions.
Copilot descriptions need steward review: AI-generated descriptions can be inaccurate for complex or poorly named schemas.

See How Kanerika Governs Fabric at Enterprise Scale
Body
Get a walkthrough of catalog setup, KAN suite deployment, and FLIP migration in action.
Schedule a Demo

How Kanerika Implements OneLake Catalog for Enterprise Clients

Kanerika is a Microsoft Solutions Partner and one of the earliest Microsoft Purview implementors globally, with 100+ enterprise clients across financial services, healthcare, insurance, manufacturing, and logistics. Catalog and governance configuration is a core part of every Fabric delivery. In a recent banking engagement, KANComply mapped data assets to SOX controls, eliminating the manual evidence collection step from audit preparation entirely. The full delivery stack includes:

KANGovern: defines and enforces data governance policy across the Fabric estate, including domain assignment rules, endorsement workflows, and certification criteria that apply consistently across all workspaces
KANComply: maps catalog assets to regulatory frameworks including GDPR, HIPAA, and SOX, giving compliance officers a direct line-of-sight from each data item to the specific controls it satisfies
KANGuard: monitors for policy violations and unauthorized access in near-real time, alerting data stewards when certified items are modified, relabelled, or accessed outside approved roles
FLIP migration accelerators: automate 50–60% of technical migration effort and generate structured catalog metadata in the process, so teams arrive at a governed catalog at completion rather than months after

Case Study: Enabling Effective Data Governance for a Leading Healthcare Organization

A top North American healthcare organization managing electronic health records, medical imaging, and administrative databases across a network of hospitals, clinics, and specialty centres needed to bring fragmented, ungoverned data under control while meeting strict regulatory requirements.

Challenges:

Data spread across Azure Blob Storage, SQL databases, and SaaS applications with no unified discovery layer
No consistent framework for data classification or metadata management, leading to poor data quality and limited visibility
Low stakeholder understanding of governance practices, blocking effective data integration and management

Solutions:

Built a centralized data catalogue using Microsoft Purview with full metadata and data classification coverage
Designed a classification framework covering steward roles, usage policies, and data handling standards
Integrated Power BI dashboards for business insights and delivered user guides and process documentation for ongoing governance

Results:

57% reduction in data discovery time
35% increase in data accuracy
70% improvement in data accessibility
90% increase in compliance adherence

Wrapping Up

The Microsoft Fabric OneLake catalog is one of the platform’s most underused capabilities. Most organizations configure it and move on. The ones getting real governance value treat it as a program, not a feature, defining quality standards, training data consumers, enforcing endorsement discipline, and connecting it to Purview governance. The difference between a basic setup and one that works shows up at 90 days, when analysts either self-serve or are still on Slack asking engineers.

Ready to Build a Governed Fabric Environment?
Talk to Kanerika’s team about your catalog and governance requirements.
Book a Meeting

FAQs

What Is the OneLake Catalog in Microsoft Fabric?

The Microsoft Fabric OneLake catalog is the platform’s built-in governance and discovery layer, covering all item types across every workspace in your tenant. Unlike the legacy OneLake Data Hub it replaced, it goes beyond search to include endorsement workflows, data quality scoring, sensitivity labels, end-to-end lineage, and Copilot-powered descriptions. The official Microsoft Learn overview has the complete feature reference.

Is OneLake Catalog the Same As Microsoft Purview Data Catalog?

No. The OneLake catalog is scoped to Microsoft Fabric items and built natively into the Fabric experience. Microsoft Purview covers enterprise-wide, multi-cloud assets and adds business glossary, automated classification, and access policy enforcement. They complement each other; Purview’s sensitivity labels and governance signals surface automatically inside the OneLake catalog.

Does the OneLake Catalog Cost Extra?

The OneLake catalog is included in Microsoft Fabric capacity licensing, no separate catalog license required. Data quality scoring requires Fabric F-SKU capacity. Copilot descriptions require Copilot to be enabled. Basic discovery and endorsement are available at no additional cost for any Fabric tenant. Full details are on the licensing page.

What Are the Endorsement Levels in the OneLake Catalog?

There are three levels: Promoted, Certified, and Master data. Certified is the most important for data consumers; it means the item has formal sign-off and meets production standards. The full permission model for endorsement is documented on Microsoft Learn.

How Do Data Quality Scores Work in the OneLake Catalog?

Data quality scores (0–100%) are generated by rules you configure in Fabric’s Data Quality feature. Rules check completeness, uniqueness, accuracy, and consistency against defined thresholds. Scores appear on catalog item cards after the first rule run.

Can the OneLake Catalog Connect to Databricks Unity Catalog or AWS Glue?

Yes, external catalog federation enables connections to Databricks Unity Catalog, AWS Glue, and other external platforms. This feature is in preview as of early 2026. Production multi-cloud governance architectures should not depend on it until it reaches full GA status.

How Do OneLake Shortcuts Interact With the Catalog?

OneLake Shortcuts appear as catalog items within the lakehouse where they are created. Sensitivity labels inherit from the parent lakehouse when label inheritance is enabled. Data quality rules must be configured explicitly against shortcut targets, they don’t inherit automatically. Lineage for external shortcuts requires Purview scanner configuration to show the complete chain.

Does the OneLake Catalog Have a Business Glossary?

No. Business glossary with term-to-asset assignment is not native to the OneLake catalog. That capability is provided by Microsoft Purview, which integrates with the catalog so governed terms can surface alongside catalog items.

Authored by

Paridhi Agrawal | Content Writer

Currently working as a content writer at Kanerika. With a strong interest in technology-focused content and digital communication, I enjoy writing blogs that blend research, creativity, and clarity to create meaningful and engaging reading experiences.

View Profile ⇒

Reviewed by

Amit Chandak | Chief Analytics Officer

Amit leads Kanerika's AI team, bringing expertise in machine learning, NLP, deep learning, and predictive analytics to help clients implement AI and extract value from their data.

View Profile ⇒

Let’s Transform Your Business

Manage cookie consent

We use cookies to enhance your experience. Consenting allows us to process data like browsing behavior or unique IDs. Not consenting may affect site functionality.
Functional Functional Always active
Preferences Preferences
Statistics Statistics
Marketing Marketing
Manage options
Manage services
Manage {vendor_count} vendors
Read more about these purposes
View preferences
{title}
{title}
{title}

AI-Powered Digital Twins for Preventive Maintenance
Limited seats available! Register Now

I agree to receive marketing messages from Kanerika via automated calls, texts, or emails. This isn’t required for purchase and I can opt out anytime.

Your Free Resource is Just a Click Away!

I agree to receive marketing messages from Kanerika via automated calls, texts, or emails. This isn’t required for purchase and I can opt out anytime.

AI Agents

AI Services

Data Services

AI Agents

AI for Enterprise

Tools

Resources

Partners