000 -LEADER |
fixed length control field |
04730cam a2200433Mi 4500 |
005 - DATE AND TIME OF LATEST TRANSACTION |
control field |
20241205124536.0 |
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS--GENERAL INFORMATION |
fixed length control field |
m o d |
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION |
fixed length control field |
cr ||||||||||| |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION |
fixed length control field |
241205b ||||| |||| 00| 0 eng d |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
9781848219045 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
https://onlinelibrary.wiley.com/doi/book/10.1002/9781119306696 |
Qualifying information |
(electronic bk.) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
1119307651 |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
9781119306696 |
Qualifying information |
(electronic bk.) |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER |
International Standard Book Number |
9781119307655 |
Qualifying information |
EPUB |
035 ## - SYSTEM CONTROL NUMBER |
System control number |
(OCoLC)953865897 |
037 ## - SOURCE OF ACQUISITION |
Stock number |
9781119307655 |
Source of stock number/acquisition |
Wiley |
041 ## - LANGUAGE CODE |
Language code of text/sound track or separate title |
eng |
050 #4 - LIBRARY OF CONGRESS CALL NUMBER |
Classification number |
QA76.9.N38 |
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER |
Classification number |
006.3/5 |
Edition number |
23 |
100 1# - MAIN ENTRY--PERSONAL NAME |
Preferred name for the person |
Fort, Karen, |
Authority record control number |
http://id.loc.gov/authorities/names/no2016109001 |
Relator term |
author. |
245 10 - TITLE STATEMENT |
Title |
Collaborative annotation for reliable natural language processing : |
Remainder of title |
technical and sociological aspects / |
Statement of responsibility, etc |
Karen Fort. |
250 ## - EDITION STATEMENT |
Edition statement |
1st. |
264 #1 - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT) |
Place of publication, distribution, etc |
London : |
Name of publisher, distributor, etc |
Wiley-ISTE, |
Date of publication, distribution, etc |
2016. |
300 ## - PHYSICAL DESCRIPTION |
Extent |
1 online resource. |
336 ## - CONTENT TYPE |
Content type term |
text |
Content type code |
txt |
Source |
rdacontent. |
337 ## - MEDIA TYPE |
Media type term |
computer |
Media type code |
c |
Source |
rdamedia. |
338 ## - CARRIER TYPE |
Carrier type term |
online resource |
Carrier type code |
cr |
Source |
rdacarrier. |
490 1# - SERIES STATEMENT |
Series statement |
Focus series. |
504 ## - BIBLIOGRAPHY, ETC. NOTE |
Bibliography, etc |
Includes bibliographical references and index. |
505 0# - CONTENTS |
Formatted contents note |
Table of Contents<br/>Preface ix<br/>List of Acronyms xi<br/><br/>Introduction xiii<br/><br/>Chapter 1. Annotating Collaboratively 1<br/><br/>1.1. The annotation process (re)visited 1<br/><br/>1.1.1. Building consensus 1<br/><br/>1.1.2. Existing methodologies 3<br/><br/>1.1.3. Preparatory work 7<br/><br/>1.1.4. Pre-campaign 13<br/><br/>1.1.5. Annotation 17<br/><br/>1.1.6. Finalization 21<br/><br/>1.2. Annotation complexity 24<br/><br/>1.2.1. Example overview 25<br/><br/>1.2.2. What to annotate? 28<br/><br/>1.2.3. How to annotate? 30<br/><br/>1.2.4. The weight of the context 36<br/><br/>1.2.5. Visualization 38<br/><br/>1.2.6. Elementary annotation tasks 40<br/><br/>1.3. Annotation tools 43<br/><br/>1.3.1. To be or not to be an annotation tool 43<br/><br/>1.3.2. Much more than prototypes 46<br/><br/>1.3.3. Addressing the new annotation challenges 49<br/><br/>1.3.4. The impossible dream tool 54<br/><br/>1.4. Evaluating the annotation quality 55<br/><br/>1.4.1. What is annotation quality? 55<br/><br/>1.4.2. Understanding the basics 56<br/><br/>1.4.3. Beyond kappas 63<br/><br/>1.4.4. Giving meaning to the metrics 67<br/><br/>1.5. Conclusion 75<br/><br/>Chapter 2. Crowdsourcing Annotation 77<br/><br/>2.1. What is crowdsourcing and why should we be interested in it? 77<br/><br/>2.1.1. A moving target 77<br/><br/>2.1.2. A massive success 80<br/><br/>2.2. Deconstructing the myths 81<br/><br/>2.2.1. Crowdsourcing is a recent phenomenon 81<br/><br/>2.2.2. Crowdsourcing involves a crowd (of non-experts) 83<br/><br/>2.2.3. “Crowdsourcing involves (a crowd of) non-experts” 87<br/><br/>2.3. Playing with a purpose 93<br/><br/>2.3.1. Using the players’ innate capabilities and world knowledge 94<br/><br/>2.3.2. Using the players’ school knowledge 96<br/><br/>2.3.3. Using the players’ learning capacities 97<br/><br/>2.4. Acknowledging crowdsourcing specifics 101<br/><br/>2.4.1. Motivating the participants 101<br/><br/>2.4.2. Producing quality data 107<br/><br/>2.5. Ethical issues 109<br/><br/>2.5.1. Game ethics 109<br/><br/>2.5.2. What’s wrong with Amazon Mechanical Turk? 111<br/><br/>2.5.3. A charter to rule them all 113<br/><br/>Conclusion 115<br/><br/>Appendix 117<br/><br/>Glossary 141<br/><br/>Bibliography 143<br/><br/>Index 163 |
520 ## - SUMMARY, ETC. |
Summary, etc |
This book presents a unique opportunity for constructing a consistent image of collaborative manual annotation for Natural Language Processing (NLP). NLP has witnessed two major evolutions in the past 25 years: firstly, the extraordinary success of machine learning, which is now, for better or for worse, overwhelmingly dominant in the field, and secondly, the multiplication of evaluation campaigns or shared tasks. Both involve manually annotated corpora, for the training and evaluation of the systems.<br/><br/>These corpora have progressively become the hidden pillars of our domain, providing food for our hungry machine learning algorithms and reference for evaluation. Annotation is now the place where linguistics hides in NLP. However, manual annotation has largely been ignored for some time, and it has taken a while even for annotation guidelines to be recognized as essential. |
545 0# - BIOGRAPHICAL OR HISTORICAL DATA |
Biographical or historical note |
About the Author<br/>Karën Fort is Associate Professor at University Paris-Sorbonne (Paris 4) working on the STIH (meaning, text, computer science, history) team. Her current research interests include collaborative manual annotation, crowdsourcing and ethics. |
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM |
Topical term or geographic name as entry element |
Natural language processing (Computer science) |
Authority record control number |
http://id.loc.gov/authorities/subjects/sh88002425. |
655 #4 - INDEX TERM--GENRE/FORM |
Genre/form data or focus term |
Electronic books. |
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE |
Uniform title |
Focus series (London, England) |
Authority record control number |
http://id.loc.gov/authorities/names/n2014186952. |
856 40 - ELECTRONIC LOCATION AND ACCESS |
Uniform Resource Identifier |
https://onlinelibrary.wiley.com/doi/book/10.1002/9781119306696 |
Link text |
Full text is available at Wiley Online Library Click here to view |
942 ## - ADDED ENTRY ELEMENTS |
Source of classification or shelving scheme |
|
Item type |
EBOOK |