数据清洗
- 网络Data cleaning;data cleansing;data clearing
-
Web数据清洗研究
Research of the Web Data Cleaning
-
基于XML数据清洗的应用研究
Study on Data Cleaning Based on XML and Its Application
-
通过开发Web文本数据清洗系统,重点研究和讨论了所涉及的Web文本清洗的关键技术。
Through the development of the system , the key technologies involved in the system are discussed .
-
数据清洗及XML技术在数字报刊中的研究与应用
Research and Application of Data Cleaning and XML Technologies Based on Digital Newspaper
-
分析了XML语言在数据清洗上的应用优势;
Analyze the privilege of XML on data cleansing ;
-
XML与数据清洗的研究
The Research of Data Cleansing with XML
-
基于RFID应用的综合性数据清洗策略
Integrated Data Cleaning Strategy Based on RFID Applications
-
基于伪事件的RFID数据清洗方法
RFID Data Cleaning Method Based on Pseudo Event
-
一种基于Token匹配的中文数据清洗方法
An approach for Chinese data cleaning based on token
-
一种ODS环境下的混合数据清洗策略
A Combined Data Cleansing Strategy Under ODS Environment
-
阐述Web挖掘和推荐系统的一些基本概念和基础知识,对推荐系统工作流程中的数据清洗进行了研究,并对数据清洗模块进行了设计与实现。
Described some basic concepts and basic knowledge of recommendation systems ; researched the date preprocessing of the recommended work flow in E-Commerce recommendation system , designed and realized the date preprocessing module . 2 .
-
基于虚拟空间粒度的RFID数据清洗方法bspace
BSpace : A Data Cleaning Approach for RFID Data Streams Based on Virtual Spatial Granularity
-
通过具体的应用验证了数据清洗系统对数据的正确性、有效性、完整性与一致性都有良好的检测与控制能力,由此证明了基于多Agent的数据清洗系统的实用性。
Through specific application , this thesis verifies that Data cleansing system has good detection and control capability at the accuracy , effectiveness , integrity and consistency of data , and verifies the practicability of data cleansing system based on multi-agent .
-
针对现有检测复制记录技术存在的不足,提出了采用Canopy聚类技术进行聚类复制记录的数据清洗方法,并通过实验结果验证了所提算法的有效性和准确性。
After analyzing problems of existing techniques for duplicate records detection , this paper proposes an approach of data cleaning , by using the Canopy clustering technique to cluster duplicate records . Experiment results show effectiveness and accuracy of these algorithms .
-
在给出ETL过程中数据清洗模型的基础上,针对已知和未知的错误类型,以及语义上的错误,提出了一种自动清洗和人为清洗相混合的数据清洗策略,具有较好的现实意义。
After discussing the data cleansing model in ETL , and to solve the known or unknown error and semantic error , this paper proposes a data cleansing strategy of combination of automatic and manual methods that has a better realism significance .
-
借助于粗糙属性向量树(RAVT)的巧妙构造,提出了两种能同时完成属性约简、数据清洗和规则提取的快速递推矩阵算法(RMC)和分布式并行矩阵算法(PMC)。
Based on a Rough Attribute Vector Tree ( RAVT ), two kinds of fast matrix computation algorithms & Recursive Matrix Computation ( RMC ) method and Parallel Matrix Computation ( PMC ) method are proposed for data cleaning and rules extraction finished synchronously in rough information system .
-
实际的开发案例证明:使用DCPM模型建模数据清洗流程并基于C+ADC框架进行数据清洗应用开发,能够快速地构建基于构件的灵活的、可扩展的数据清洗应用软件。
A practical development case has proven that development of data cleansing application based on DCPM ( Data Cleansing Process Model ) and C + ADC ( Component-extended Agile Data Cleaning ), can construct quickly a flexible and extendable component-based data cleansing application software .
-
垂直搜索中的数据清洗和排序算法研究
Research on Data Cleaning and Ranking Algorithm in Vertical Search Engine
-
一种基于聚类树的增量式数据清洗算法
An incremental algorithms of data cleansing based on clustering tree
-
该文提出并实现了一个可扩展的数据清洗框架。
This paper presents an open and extensible framework for data cleaning .
-
交通流数据清洗的关键理论及方法研究
Study on Key Theory and Methods for Data Cleaning of Traffic Flow
-
数据清洗方法与构件的综合技术研究
An integrated technology of method and component for data cleaning
-
定量专利分析的样本选取与数据清洗
Sample selection and data cleansing for quantitative analysis of patents
-
系统提供了方便、易用的可视化的数据清洗流程定义环境。
The system provides a visual environment to define the data cleaning workflow .
-
数据清洗技术在期刊元数据整合中的应用
The Application of Data Cleaning in Periodical Metadata Integration
-
基于软件总线模型的数据清洗系统的研究与实现
Research and Implementation of the Data Clean System Based on Software Bus Model
-
基于聚类分析技术的数据清洗研究
Improved Algorithms for Data Cleansing Based on Clustering Analysis
-
基于聚类模式的数据清洗技术
Towards Data-Mining : Data Cleaning Based on Clustering Techniques
-
然后总结了数据清洗技术的原理方法。
Second , summarize the principle and the method of data cleansing techniques .
-
因此,必须进行数据清洗来提高信息系统的数据质量。
So , data cleaning is vital to improve data quality of information system .