【正文】
on, and a bottleneck of information service. Therefore, how to manage data efficiently and improve the quality to make data an effective basis for decisionmaking department is a problem with high research value and practical significance. In this context, this dissertation according to the different types of data errors through implementing specific program uses the appropriate solutions to verify the validity of the , this dissertation introduces the definition of data quality, classification, evaluation index and the technology of improving the data quality. Second, summarize the principle and the method of data cleansing techniques. Finally, give the corresponding solutions for different error types especially on the duplicate records and similar abnormal data detection considering the link within data, this dissertation detects abnormal