This guide covers the overall requirements and configuration options needed to properly set-up and configure your Discovery Hub Environment.
Complete the following steps to properly configure your system. Click the links to jump to each section.
- Understand the Discovery Hub Environment
- Deploy in the Cloud or On-Premises
- Configure Discovery Hub Services
Understand the Discovery Hub Environment
To properly configure the Discovery Hub environment, it’s important to first understand how the application interacts with other applications and data storage platforms.
The Discovery Hub platform can be understood as a database abstraction layer on top of Microsoft Data Storage platforms (SQL Server or Azure Data Services) enabling the rapid development and maintenance of a modern data estate. Any working Discovery Hub environment primarily consists of 4 working parts, the application (UI), services, project repository, and data storage.
The application (UI) is used to define a set of data models, movements and transformations. Clicking Save stores these “definitions” as metadata in a SQL database known as the project repository. During deployment, Discovery Hub uses this metadata to generate instructions (SQL stored procedures or similar) which are sent to the desired data storage. During execution, SQL server will carry out these instructions, loading data from source systems, transforming it and loading it into the defined models in the target databases. The Discovery Hub services ensure things like schedule execution and multiple environments continue to work without users present.
Deploy in the Cloud or On-Premises
Discovery Hub can be deployed either in the Cloud using Azure PaaS Database technology, or On-Premises using SQL Server.
Cloud Deployment (Recommended)
Running both Discovery Hub and your data storage platform(s) in the Azure cloud provides a number of benefits.
- Because time matters: When deploying an Azure Marketplace Template your analytics environment can be ready in as little as 30 minutes.
- Improved Performance: All Azure services are managed and configured for optimal performance as well as active geo-replication ensure near 100% uptime.
- Security: Azure enforces the strictest data security standards such as Transport Security Layer (TLS) and transparent data encryption.
- Reduced Cost: By leveraging Azure's data centers you don't need to pay for hardware or expensive DBA's to maintain your infrastructure.
If you would like to deploy Discovery Hub in the cloud the next step is to Choose & Deploy an Azure Marketplace Template.
If you are not quite ready to move to the cloud, you can Configure & Deploy Discovery Hub On-Premises.
Configure Discovery Hub Services
Configure the Scheduler
When you are ready to begin recuring updates of your data estate you can Configure the Discovery Hub Scheduler Service.
Configure the ODX Server (optional)
If you wish to extract and store large amounts of raw data in a Data Lake or Operational Data Store, you can Configure the ODX Server.
Configure Multiple Environments (optional)
If you wish to keep your production data separate from your development and testing environments you can Configure Multiple Environments.