ssis组件
In this article, we will talk briefly about data quality in SQL Server. Then, we will give a brief overview of the Melissa Data Quality for SQL Server Integration Services (SSIS), and we will demonstrate the components available in the community edition.
在本文中,我们将简要讨论SQL Server中的数据质量。 然后,我们将简要概述SQL Server集成服务(SSIS)的Melissa数据质量,并演示社区版中可用的组件。
介绍 (Introduction)
In general, data quality is the level of how much data fits its serving context. Enhancing the data quality is very critical since poor quality leads to inaccurate reporting and results in wrong decisions and inevitably economic damages. For this reason, each data management system provided a bunch of tools used to improve the data quality level.
通常,数据质量是多少数据适合其服务上下文的级别。 数据质量的提高非常关键,因为质量差会导致报告不准确,并导致错误的决策和不可避免的经济损失。 因此,每个数据管理系统都提供了一堆用于提高数据质量级别的工具。
For SQL Server, many technologies can be used to enhance data quality:
对于SQL Server,可以使用许多技术来提高数据质量:
SQL Server数据质量服务(DQS) (SQL Server Data Quality Services (DQS))
Data Quality Services is a knowledge-driven data quality feature developed by Microsoft and released in SQL Server 2012. It can be installed from the SQL Server installation, and it provides different services, such as building a knowledge base, data de-duplication, standardization.
数据质量服务是由Microsoft开发并在SQL Server 2012中发布的知识驱动的数据质量功能。可以从SQL Server安装中安装它,并且它提供不同的服务,例如构建知识库,重复数据删除,标准化。
To learn more about this feature, you can refer to the following articles:
要了解有关此功能的更多信息,可以参考以下文章:
- How to clean data using Data Quality Services and SQL Server Integration Services 如何使用数据质量服务和SQL Server集成服务清除数据
- How to use SQL Server Data Quality Services to ensure the correct aggregation of data 如何使用SQL Server数据质量服务确保正确的数据聚合
- How to clean Master Data Services data using Data Quality Services in SQL Server 如何在SQL Server中使用数据质量服务清除主数据服务数据
使用众所周知的Microsoft SSIS组件 (Using the well-known Microsoft SSIS Components)
SQL Server Integration Services provides a bunch of components that can be used to assess and enhance data quality. These operations can be performed at the control flow level, such as data profiling and validation, or the data flow level using fuzzy lookups, conditional splits, derived columns, script component, and others.
SQL Server Integration Services提供了一堆可用于评估和增强数据质量的组件。 这些操作可以在控制流级别上执行,例如数据概要分析和验证,或者使用模糊查找 , 条件拆分 , 派生列 , 脚本组件等在数据流级别上执行。
编写SQL语句 (Writing SQL Statements)
One of the most popular data cleaning approaches is implementing your own logic using SQL statements, which is known as data wrangling. SQL Server provided a bunch of system functions that can be used to improve data quality.
最受欢迎的数据清理方法之一是使用SQL语句实现自己的逻辑,这称为数据整理。 SQL Server 提供了许多系统功能,可用于提高数据质量 。
使用第三方组件 (Using third-party components)
One of the most beautiful things in the Visual Studio IDE is that it is allowed to develop third-party components and integrate them within Microsoft products such as SSIS. Many companies developed third-party SSIS components such as CDATA, Kingwaysoft, and COZYROC.
Visual Studio IDE中最漂亮的功能之一是允许开发第三方组件并将其集成到Microsoft产品(例如SSIS)中。 许多公司开发了第三方SSIS组件,例如CDATA,Kingwaysoft和COZYROC。
Regarding data quality, one of the most popular products in the market is Melissa data quality components for SQL Server.
关于数据质量,市场上最受欢迎的产品之一是用于SQL Server的Melissa数据质量组件。
Melissa SQL Server的数据质量 (Melissa Data Quality for SQL Server)
Melissa data quality tools are a set of SSIS components that are used to clean and enrich data during the data transfer or integration process. Two editions are available:
Melissa数据质量工具是一组SSIS组件,用于在数据传输或集成过程中清理和丰富数据。 有两个版本 :
- Enterprise edition: Commercial, contains a wide variety of data quality components and online services
- 企业版 :商业版 ,包含各种数据质量组件和在线服务
- Community edition: Free, but only a few components are available (check the link above) 社区版 :免费,但只有少数组件可用(请查看上面的链接)
In this article, we will be talking about the community edition, and we will briefly illustrate its components.
在本文中,我们将讨论社区版本,并简要说明其组成部分。
下载Melissa数据质量社区版 (Download Melissa data quality community edition)
To download the Melissa data quality community edition, you should navigate to the SQL Server editions page. Then, request a demo by filling the form located on the left side of the page. And make sure to select the community edition.
要下载Melissa数据质量社区版本,您应该导航到SQL Server版本页面 。 然后,通过填写页面左侧的表格来请求演示。 并确保选择社区版本。