KOBV Portal

Hits per page

hit 1 - 1 | 1 hit

Select All Export

Book

Self-adaptive data quality : automating duplicate detection (24)

Zieger, Tobias [VerfasserIn] ; Naumann, Felix [AkademischeR BetreuerIn] 1971- ; Universität Potsdam

Potsdam

add to watchlist on the watchlist

Details

UID:

gbv_1025692632

Format: vii, 125 Seiten , Illustrationen, Diagramme

Content: Carrying out business processes successfully is closely linked to the quality of the data inventory in an organization. Lacks in data quality lead to problems: Incorrect address data prevents (timely) shipments to customers. Erroneous orders lead to returns and thus to unnecessary effort. Wrong pricing forces companies to miss out on revenues or to impair customer satisfaction. If orders or customer records cannot be retrieved, complaint management takes longer. Due to erroneous inventories, too few or too much supplies might be reordered. A special problem with data quality and the reason for many of the issues mentioned above are duplicates in databases. Duplicates are different representations of same real-world objects in a dataset. However, these representations differ from each other and are for that reason hard to match by a computer. Moreover, the number of required comparisons to find those duplicates grows with the square of the dataset size. To cleanse the data, these duplicates must be detected and removed. Duplicate detection is a very laborious process. To achieve satisfactory results, appropriate software must be created and configured (similarity measures, partitioning keys, thresholds, etc.). Both requires much manual effort and experience. - This thesis addresses automation of parameter selection for duplicate detection and presents several novel approaches that eliminate the need for human experience in parts of the duplicate detection process. - [...]

Note: Dissertation Universität Potsdam 2018

Additional Edition: Erscheint auch als Online-Ausgabe Zieger, Tobias Self-adaptive data quality Potsdam, 2018

Language: English

Keywords: Datenqualität ; Datenverarbeitung ; Partitionierung ; Hochschulschrift

URL: Inhaltsverzeichnis

Author information: Naumann, Felix 1971-

Library	Location	Call Number	Volume/Issue/Year	Availability

Others were also interested in ...

Book

Inter-library loan

UB Potsdam

hit 1 - 1 | 1 hit

Kooperativer Bibliotheksverbund

Berlin Brandenburg