UID:
(DE-602)gbv_835790460
Format:
1 Online-Ressource (xv, 106 Seiten)
,
Illustrationen
Edition:
Also available in print
ISBN:
1627058044
,
9781627058049
Series Statement:
Synthesis lectures on the semantic web #13
Content:
In recent years, several knowledge bases have been built to enable large-scale knowledge sharing, but also an entity-centric Web search, mixing both structured data and text querying. These knowledge bases offer machine-readable descriptions of real-world entities, e.g., persons, places, published on the Web as Linked Data. However, due to the different information extraction tools and curation policies employed by knowledge bases, multiple, complementary and sometimes conflicting descriptions of the same real-world entities may be provided. Entity resolution aims to identify different descriptions that refer to the same entity appearing either within or across knowledge bases. The objective of this book is to present the new entity resolution challenges stemming from the openness of the Web of data in describing entities by an unbounded number of knowledge bases, the semantic and structural diversity of the descriptions provided across domains even for the same real-world entities, as well as the autonomy of knowledge bases in terms of adopted processes for creating and curating entity descriptions. The scale, diversity, and graph structuring of entity descriptions in the Web of data essentially challenge how two descriptions can be effectively compared for similarity, but also how resolution algorithms can efficiently avoid examining pairwise all descriptions. The book covers a wide spectrum of entity resolution issues at the Web scale, including basic concepts and data structures, main resolution tasks and workflows, as well as state-of-the-art algorithmic techniques and experimental trade-offs
Content:
1. Web of data: describing and linking entities --
Content:
2. Matching and resolving entities -- 2.1 The problem of entity resolution -- 2.2 Similarity functions -- 2.2.1 Content-based similarity functions -- 2.2.2 Relational similarity functions -- 2.2.3 Approximations of similarity functions -- 2.3 Discussion --
Content:
3. Blocking -- 3.1 The problem of entity blocking -- 3.2 Blocking in traditional data warehouses -- 3.3 Blocking in the web of data -- 3.4 Block post-processing methods -- 3.5 Discussion --
Content:
4. Iterative entity resolution -- 4.1 The problem of iterative entity resolution -- 4.2 Merging-based iterative entity resolution -- 4.3 Relationship-based iterative entity resolution -- 4.4 Iterative blocking -- 4.5 Incremental entity resolution -- 4.6 Progressive entity resolution -- 4.7 Discussion --
Content:
5. Experimental evaluation of blocking algorithms -- 5.1 Datasets -- 5.2 Measures -- 5.3 Quality results -- 5.3.1 Identified matches (TPS) -- 5.3.2 Missed matches (FNS) -- 5.3.3 Non-matches (FPS and TNS) -- 5.4 Performance results -- 5.5 Different types of links -- 5.6 Lessons learned --
Content:
6. Conclusions -- Bibliography -- Authors' biographies
Note:
Includes bibliographical references (pages 91-104)
,
1. Web of data: describing and linking entities
,
2. Matching and resolving entities2.1 The problem of entity resolution -- 2.2 Similarity functions -- 2.2.1 Content-based similarity functions -- 2.2.2 Relational similarity functions -- 2.2.3 Approximations of similarity functions -- 2.3 Discussion
,
3. Blocking3.1 The problem of entity blocking -- 3.2 Blocking in traditional data warehouses -- 3.3 Blocking in the web of data -- 3.4 Block post-processing methods -- 3.5 Discussion
,
4. Iterative entity resolution4.1 The problem of iterative entity resolution -- 4.2 Merging-based iterative entity resolution -- 4.3 Relationship-based iterative entity resolution -- 4.4 Iterative blocking -- 4.5 Incremental entity resolution -- 4.6 Progressive entity resolution -- 4.7 Discussion
,
5. Experimental evaluation of blocking algorithms5.1 Datasets -- 5.2 Measures -- 5.3 Quality results -- 5.3.1 Identified matches (TPS) -- 5.3.2 Missed matches (FNS) -- 5.3.3 Non-matches (FPS and TNS) -- 5.4 Performance results -- 5.5 Different types of links -- 5.6 Lessons learned
,
6. ConclusionsBibliography -- Authors' biographies.
,
Also available in print.
,
Mode of access: World Wide Web.
,
System requirements: Adobe Acrobat Reader.
Additional Edition:
ISBN 1627058036
Additional Edition:
ISBN 9781627058032
Language:
English
Bookmarklink