Pentaho Data Catalog
Data QualityData IntegrationBusiness Analytics
  • Overview
    • Pentaho Data Catalog ..
  • Overview
  • Data Catalog
    • Getting Started
      • Data Sources
      • Process the data
      • Identify the data
      • Business Glossaries & Terms
      • Reference Data
      • Data Lineage
    • Management
      • Users, Roles & Community
      • Data Identification Methods
      • Business Rules
      • Metadata Rules
      • Schedules
      • Workers
    • Components
      • Keycloak
      • Reverse Proxy Server
      • App Server
      • Metadata Store
      • Worker Server
      • Observability
Powered by GitBook
On this page
  1. Data Catalog
  2. Management

Metadata Rules

Enrich the metadata allowing the user to perform various catalog functions ..

PreviousBusiness RulesNextSchedules

Last updated 1 year ago

Metadata rules are pivotal for transforming a data catalog into a dynamic, self-service repository, enhancing access and understanding of data assets.

• Reporting: They enable the generation of reports on data usage, lineage, and custom properties, offering insights into organizational data utilization.

• Automation: Metadata rules streamline the curation process through automation, thereby reducing manual effort and increasing the efficiency of managing the data catalog.

• Search and Discovery: Enhanced metadata improves the searchability of data assets, facilitating quick access to the most relevant data for analytical or business purposes.

• Data Quality and Governance: These rules are crucial for maintaining data quality by enforcing standards and policies to ensure accuracy, consistency, and compliance with regulations.

• Tagging and Classification: By automatically tagging data assets with relevant labels, metadata rules simplify the categorization and retrieval process for analysis.

Metadata rules in a data catalog play a key role in efficiently organizing, managing, and leveraging data.

The following steps will guide you through the enablement and

By default the Metadata Rules engine is not enabled. In future releases you will require a license

  1. To enable the engine, create the required JWT token.

cd
cd /opt/pentaho/pdc-docker-deployment
  1. Switch to root user and write out the token to .env file.

sudo su
echo RULES_PDC_AUTH_TOKEN=\"$(./pdc get-jwt-token RULES_ENGINE)\" >> ./conf/.env
  1. Check that the RULES_PDC_AUTH_TOKEN token has been written to the .env file.

cd ./conf
cat .env
  1. Redeploy PDC.

cd ..
./pdc.sh up
  1. Finally log back into PDC and navigate to: Management -> Roles.

  2. Under the 'Data Quality Administrator' role -> View details and add the user(s).

  3. x

  4. x

x

x

x

x

x

x

Metadata Rules