You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: Purview/Cost-Estimation.md
+18-13Lines changed: 18 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -30,6 +30,7 @@ Last updated: 2025-07-17
30
30
31
31
> [!IMPORTANT]
32
32
> The general formula to keep in mind for estimating the cost of Microsoft Purview is: <br/>
33
+
>
33
34
> -**Cost of Data Map**: Calculated based on the number of capacity units and the price per capacity unit per hour. <br/>
34
35
> -**Cost of Scanning**: Calculated based on the total duration (in minutes) of all scans in a month, divided by 60 minutes per hour, multiplied by the number of vCores per scan, and the price per vCore per hour. <br/>
35
36
> -**Cost of Resource Set**: Calculated based on the total duration (in hours) of processing resource set data assets in a month, multiplied by the price per vCore per hour.
@@ -38,26 +39,29 @@ Last updated: 2025-07-17
38
39
$$
39
40
40
41
1. Data Map (Always on):
41
-
- Number of Capacity Units: Typically 1
42
-
- Total Hours in a Month: 730 hours
43
-
- Price per Capacity Unit per Hour: \$0.411
42
+
43
+
- Number of Capacity Units: Typically 1
44
+
- Total Hours in a Month: 730 hours
45
+
- Price per Capacity Unit per Hour: \$0.411
44
46
45
47
$$
46
48
\text{Total Cost for Data Map} = \text{Number of Capacity Units} \times \text{Total Hours in a Month} \times \text{Price per Capacity Unit per Hour}
47
49
$$
48
50
49
51
2. Scanning (Pay as you go):
50
-
- Total Minutes of Scanning in a Month: [M] minutes
51
-
- Number of vCores per Scan: 32
52
-
- Price per vCore per Hour: \$0.63
52
+
53
+
- Total Minutes of Scanning in a Month: [M] minutes
54
+
- Number of vCores per Scan: 32
55
+
- Price per vCore per Hour: \$0.63
53
56
54
57
$$
55
58
\text{Total Cost for Scanning} = \left( \frac{\text{Total Minutes of Scanning in a Month}}{60} \right) \times \text{Number of vCores per Scan} \times \text{Price per vCore per Hour}
56
59
$$
57
60
58
61
3. Resource Set:
59
-
- Total Hours of Processing in a Month: [H] hours
60
-
- Price per vCore per Hour: \$0.21
62
+
63
+
- Total Hours of Processing in a Month: [H] hours
64
+
- Price per vCore per Hour: \$0.21
61
65
62
66
$$
63
67
\text{Total Cost for Resource Set} = \text{Total Hours of Processing in a Month} \times \text{Price per vCore per Hour}
@@ -117,10 +121,11 @@ $$
117
121
## Cost Estimation for Different Metadata Volumes
118
122
119
123
> [!IMPORTANT]
120
-
> Microsoft Purview `scans metadata to classify, label, and protect data assets`. It does `not scan the actual data content but rather the information about the data`. <br/>
121
-
> `The size of the data itself does not directly` impact the cost of `metadata scanning unless it affects the amount of metadata generated`. The `number of metadata assets and their complexity` are the primary factors influencing costs.
124
+
> Microsoft Purview `scans metadata to classify, label, and protect data assets`. It does `not scan the actual data content but rather the information about the data`. <br/>
125
+
> `The size of the data itself does not directly` impact the cost of `metadata scanning unless it affects the amount of metadata generated`. The `number of metadata assets and their complexity` are the primary factors influencing costs.
122
126
123
127
Assumptions:
128
+
124
129
- The number of metadata assets is assumed based on the data volume, with an average size of 1 MB per metadata asset.
125
130
- The average size of each metadata asset is assumed to be 1 MB.
126
131
- These estimates are based on the assumption that the governed assets and data management costs are applied for 100 hours per month. Actual costs may vary based on specific agreements with Microsoft, usage patterns, etc.
@@ -145,7 +150,7 @@ Assumptions:
145
150
- Include **Managed Virtual Network** and **data transfer** costs if applicable.
146
151
- Get a **real-time, region-specific estimate** (e.g., for Costa Rica or any other region).
-[How Microsoft Purview can be used](#how-microsoft-purview-can-be-used)
36
-
-[Scenario 1: Data Governance for a Financial Institution](#scenario-1-data-governance-for-a-financial-institution)
37
-
-[Scenario 2: Data Protection for a Healthcare Provider](#scenario-2-data-protection-for-a-healthcare-provider)
38
-
-[Scenario 3: Data Analytics for an E-commerce Company](#scenario-3-data-analytics-for-an-e-commerce-company)
39
-
-[Scenario 4: Compliance Management for a Global Enterprise](#scenario-4-compliance-management-for-a-global-enterprise)
36
+
-[Scenario 1: Data Governance for a Financial Institution](#scenario-1-data-governance-for-a-financial-institution)
37
+
-[Scenario 2: Data Protection for a Healthcare Provider](#scenario-2-data-protection-for-a-healthcare-provider)
38
+
-[Scenario 3: Data Analytics for an E-commerce Company](#scenario-3-data-analytics-for-an-e-commerce-company)
39
+
-[Scenario 4: Compliance Management for a Global Enterprise](#scenario-4-compliance-management-for-a-global-enterprise)
40
40
-[Examples of use cases](#examples-of-use-cases)
41
41
-[Collect metadata information from Apache Airflow](#collect-metadata-information-from-apache-airflow)
42
42
@@ -72,6 +72,7 @@ Last updated: 2025-07-17
72
72
## Overview
73
73
74
74
> Keypoints of Microsoft Purview: <br/>
75
+
>
75
76
> 1.`Integration with Microsoft Ecosystem`: Purview offers deep integration with Azure, Power BI, and Microsoft 365, providing a seamless experience for organizations already using these tools. <br/>
76
77
> 2.`Advanced Governance and Compliance`: Purview provides robust governance and compliance features, ensuring your data management practices meet regulatory standards. <br/>
77
78
> 3.`AI-Powered Search and Discovery`: With AI-driven capabilities, Purview enhances data discovery and classification, making it easier to find and manage data assets. <br/>
@@ -521,18 +522,20 @@ Find below different scenarios to manage data governance, protection, and compli
521
522
> This capability is currently in public preview and is achieved through integration with **OpenLineage**, an open framework for data lineage collection and analysis.
522
523
523
524
How it works:
525
+
524
526
1.**Enable OpenLineage in Airflow**: By enabling OpenLineage in your Airflow instance, metadata and lineage information about jobs and datasets are automatically tracked as Directed Acyclic Graphs (DAGs) execute.
525
527
2.**Azure Event Hubs**: The tracked metadata and lineage information are sent to an Azure Event Hubs instance that you configure.
526
528
3.**Microsoft Purview**: Purview subscribes to the events from Azure Event Hubs, parses them, and ingests the metadata and lineage into the data map.
527
529
528
530
This integration supports capturing metadata such as:
Copy file name to clipboardExpand all lines: README.md
+19-21Lines changed: 19 additions & 21 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,7 +18,6 @@ Last updated: 2025-07-17
18
18
19
19
</details>
20
20
21
-
22
21
> [!IMPORTANT]
23
22
> The [Azure Databases Advisor Tool](https://microsoftcloudessentials-learninghub.github.io/Azure-Databases-Purview-Advisor/) is designed to help users select the most suitable Azure database service based on their specific use case. It provides recommendations by analyzing user inputs such as data type, scalability needs, latency requirements, and more.
24
23
> The information provided and any document (such as scripts, sample codes, etc.) is provided `AS-IS` and `WITH ALL FAULTS`. Pricing estimates are for `demonstration purposes only and do not reflect final pricing`. `Microsoft assumes no liability` for your use of this information and makes no guarantees or warranties, expressed or implied, regarding its accuracy or completeness, including any pricing details. `Please note that these demos are intended as a guide and are based on personal experiences. For official guidance, support, or more detailed information, please refer to Microsoft's official documentation or contact Microsoft directly`: [Microsoft Sales and Support](https://support.microsoft.com/contactus?ContactUsExperienceEntryPointAssetId=S.HP.SMC-HOME)
@@ -33,30 +32,29 @@ Last updated: 2025-07-17
33
32
<summary><b>Details</b> (Click to expand)</summary>
34
33
35
34
> -**Formats**<br/>
36
-
> - Structured: Stored in predefined formats like rows and columns with consistent schema enforcement.<br/>
37
-
> - Unstructured: Exists in diverse formats like free text, images, audio, video, and documents that lack a formal structure.<br/>
35
+
> - Structured: Stored in predefined formats like rows and columns with consistent schema enforcement.<br/>
36
+
> - Unstructured: Exists in diverse formats like free text, images, audio, video, and documents that lack a formal structure.<br/>
38
37
> -**Storage Model**<br/>
39
-
> - Structured: Uses rigid, predefined schemas in relational databases ensuring integrity and data validation.<br/>
40
-
> - Unstructured: Stored in flexible formats such as object storage, document stores, or blob storage without a fixed schema.<br/>
38
+
> - Structured: Uses rigid, predefined schemas in relational databases ensuring integrity and data validation.<br/>
39
+
> - Unstructured: Stored in flexible formats such as object storage, document stores, or blob storage without a fixed schema.<br/>
41
40
> -**Databases**<br/>
42
-
> - Structured: Managed through SQL-based systems like Azure SQL, MySQL, and PostgreSQL.<br/>
43
-
> - Unstructured: Supported by NoSQL systems like Cosmos DB, MongoDB, and cloud-native data lakes.<br/>
41
+
> - Structured: Managed through SQL-based systems like Azure SQL, MySQL, and PostgreSQL.<br/>
42
+
> - Unstructured: Supported by NoSQL systems like Cosmos DB, MongoDB, and cloud-native data lakes.<br/>
44
43
> -**Ease of Search**<br/>
45
-
> - Structured: Easily queried using SQL, indexing, and standardized query languages.<br/>
46
-
> - Unstructured: Requires more advanced approaches like keyword extraction, OCR, or AI-assisted search tools.<br/>
44
+
> - Structured: Easily queried using SQL, indexing, and standardized query languages.<br/>
45
+
> - Unstructured: Requires more advanced approaches like keyword extraction, OCR, or AI-assisted search tools.<br/>
47
46
> -**Analysis Methods**<br/>
48
-
> - Structured: Suited for quantitative techniques, including statistical modeling, trend analysis, and aggregation.<br/>
49
-
> - Unstructured: Often analyzed with qualitative approaches like NLP, sentiment analysis, topic modeling, or deep learning.<br/>
47
+
> - Structured: Suited for quantitative techniques, including statistical modeling, trend analysis, and aggregation.<br/>
48
+
> - Unstructured: Often analyzed with qualitative approaches like NLP, sentiment analysis, topic modeling, or deep learning.<br/>
50
49
> -**Tools and Technologies**<br/>
51
-
> - Structured: RDBMS (SQL Server, Oracle), OLTP systems, CRM platforms, and OLAP tools for analytics.<br/>
52
-
> - Unstructured: NoSQL DBMS, data mining frameworks, ML pipelines, AI services, and visualization platforms like Power BI.<br/>
50
+
> - Structured: RDBMS (SQL Server, Oracle), OLTP systems, CRM platforms, and OLAP tools for analytics.<br/>
51
+
> - Unstructured: NoSQL DBMS, data mining frameworks, ML pipelines, AI services, and visualization platforms like Power BI.<br/>
53
52
> -**Specialists**<br/>
54
-
> - Structured: Typically handled by business analysts, software engineers, solution architects, and DBAs.<br/>
55
-
> - Unstructured: Requires data scientists, AI/ML specialists, information architects, and advanced data engineers.<br/>
53
+
> - Structured: Typically handled by business analysts, software engineers, solution architects, and DBAs.<br/>
54
+
> - Unstructured: Requires data scientists, AI/ML specialists, information architects, and advanced data engineers.<br/>
56
55
57
56
</details>
58
57
59
-
60
58
## Products/Services
61
59
62
60
```mermaid
@@ -127,7 +125,7 @@ Click here to read more about a [quick guide on SQL Server on Azure Virtual Mach
127
125
<details>
128
126
<summary><b>Azure Database for PostgreSQL</b> (PaaS) - Click to expand </summary>
129
127
130
-
> Enterprise-ready community PostgreSQL database service, fully managed by Microsoft.
128
+
> Enterprise-ready community PostgreSQL database service, fully managed by Microsoft.
131
129
132
130
> -**Benefits:** High availability with up to 99.99% SLA, built-in security, and scalability.<br/>
133
131
> -**Differentiators:** Supports PostgreSQL extensions and advanced indexing options.<br/>
@@ -169,7 +167,7 @@ Click here to read more about a [quick guide on Oracle Database on Azure](./sql/
169
167
<details>
170
168
<summary><b>SQL Server 2022</b> (IaaS) - Click to expand </summary>
171
169
172
-
> Latest release of SQL Server with built-in hybrid and cloud-connected capabilities.
170
+
> Latest release of SQL Server with built-in hybrid and cloud-connected capabilities.
173
171
174
172
> -**Benefits:** Brings innovations like ledger tables, Synapse Link, and built-in security enhancements.<br/>
175
173
> -**Differentiators:** Full hybrid flexibility for modern apps with backward compatibility.<br/>
@@ -197,7 +195,7 @@ Click here to read more about a [quick guide on Azure Cosmos DB](./nosql/azure-c
197
195
<details>
198
196
<summary><b>Azure Managed Instance for Apache Cassandra</b> (PaaS) - Click to expand </summary>
199
197
200
-
> Managed Cassandra database service designed for massive scale and availability.
198
+
> Managed Cassandra database service designed for massive scale and availability.
201
199
202
200
> -**Benefits:** Built-in automation, scalability, and hybrid deployment options.<br/>
203
201
> -**Differentiators:** Supports native Cassandra drivers and schemas with Azure-managed benefits.<br/>
@@ -252,7 +250,7 @@ Click here to read more about a [quick guide on Azure Cache for Redis](./nosql/a
0 commit comments