Getting Started
Walk-through of Pentaho Data Catalog ..
Last updated
Walk-through of Pentaho Data Catalog ..
Last updated
Pentaho Data Catalog is a powerful tool that enables data engineers, data scientists, and business users to accelerate their data intelligence journey. It automatically discovers, classifies, and contextualizes structured and unstructured data. Here are some key features:
Powerful Business Glossary: Contextualize data with business vocabulary based on governance policies and business rules. This helps activate metadata and ensures alignment with business language.
Data Lineage and Trust: Track data lineage with Open Lineage, building trust as data flows through your organization. Enable data quality and remediation activities.
Observability and Monitoring: A robust observability stack captures popular assets, popular searches, and trends. This helps stewardship organizations focus their energy on the right data.
Integration and Scalability: API-powered integrations with various platforms (NetApp, SAP Hana, S3, SQL views) ensure interoperability. The modern architecture design scales seamlessly.
Enterprise Security: Features include role-based access control (RBAC), password vault support, minimum privileges, multifactor authentication, secure cloud deployments, and no data deduplication.
Discover, understand, and govern your data with Pentaho Data Catalog. It offers faster discovery, lower total cost of ownership (TCO), and improved data quality.
To access your catalog, please follow these steps:
Open Google Chrome web browser.
Enter the following email and password, then click Sign In.
system_admin@hv.com
Welcome123!
All the Roles combined
admin@hv.com
Welcome123!
Community & User Administrator
business_steward@hv.com
Welcome123!
Manage Business Glossary
business_user@hv.com
Welcome123!
View Business Glossary
data_user@hv.com
Welcome123!
Add & Delete content
data_developer@hv.com
Welcome123!
Manage Business Rules & Domain Assets
data_steward@hv.com
Welcome123!
Manage most features except Glossary
For enhanced security, it is strongly recommended that users avoid saving their login details directly in web browsers. Browsers may inadvertently autofill these credentials in unrelated fields, posing a security risk.
Best Practice
• Disable Autofill: To mitigate potential risks, users should disable the autofill functionality for login credentials in their browser settings. This preventive measure ensures that sensitive information is not unintentionally exposed or misused.
You can access your user profile via the top menu bar or navigate to various features using the left menu bar.
The Home page serves as a central hub for accessing business tools relevant to your role, including data canvas, business glossary, management tools, and worker resources.
Access the top user menu to manage your user profile, manage assigned data sources, and log out.
View the following table for details about these features:
Apps
Click to explore all apps associated with Data Catalog, including Dashboard, that extend the visual discovery and relationship discovery capabilities of Data Catalog.
Profile
Click the Profile icon to open the User Profile and Data Sources where the user can manage the details and assign data sources with the required access levels.
More
Click the More icon and select Log Out to log out of Data Catalog
Edit
Click Edit to open the Landing Page Options window, where you can configure the landing page with available options in Shortcuts and Tables. Additionally, you can choose to have a vertical or stacked layout in Layout.
Home
Returns you to the Home page from your current location in Data Catalog.
Data Canvas
Glossary
Opens the Business Glossary page where you can create, organize, and curate business terms to help you navigate your data.
Reference Data
Management
Workers
Monitor the data activities’ progress on the Worker’s screen.
Navigate to
Explore your data in the Data Canvas. For more information, see .
For more information, see .
Opens the Reference Data page. Reference data sets contain relatively static, unchanging data values that are commonly used by an organization. For more information, see .
Manage your data sources, users, user roles, workers, business rules, schedules, dictionaries, and more in the page.