Collaborative annotation for reliable natural language processing : (Record no. 89239)

000 -LEADER
fixed length control field 04730cam a2200433Mi 4500
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20241205124536.0
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS--GENERAL INFORMATION
fixed length control field m o d
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field cr |||||||||||
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 241205b ||||| |||| 00| 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781848219045
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number https://onlinelibrary.wiley.com/doi/book/10.1002/9781119306696
Qualifying information (electronic bk.)
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 1119307651
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781119306696
Qualifying information (electronic bk.)
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9781119307655
Qualifying information EPUB
035 ## - SYSTEM CONTROL NUMBER
System control number (OCoLC)953865897
037 ## - SOURCE OF ACQUISITION
Stock number 9781119307655
Source of stock number/acquisition Wiley
041 ## - LANGUAGE CODE
Language code of text/sound track or separate title eng
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number QA76.9.N38
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 006.3/5
Edition number 23
100 1# - MAIN ENTRY--PERSONAL NAME
Preferred name for the person Fort, Karen,
Authority record control number http://id.loc.gov/authorities/names/no2016109001
Relator term author.
245 10 - TITLE STATEMENT
Title Collaborative annotation for reliable natural language processing :
Remainder of title technical and sociological aspects /
Statement of responsibility, etc Karen Fort.
250 ## - EDITION STATEMENT
Edition statement 1st.
264 #1 - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication, distribution, etc London :
Name of publisher, distributor, etc Wiley-ISTE,
Date of publication, distribution, etc 2016.
300 ## - PHYSICAL DESCRIPTION
Extent 1 online resource.
336 ## - CONTENT TYPE
Content type term text
Content type code txt
Source rdacontent.
337 ## - MEDIA TYPE
Media type term computer
Media type code c
Source rdamedia.
338 ## - CARRIER TYPE
Carrier type term online resource
Carrier type code cr
Source rdacarrier.
490 1# - SERIES STATEMENT
Series statement Focus series.
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc Includes bibliographical references and index.
505 0# - CONTENTS
Formatted contents note Table of Contents<br/>Preface ix<br/>List of Acronyms xi<br/><br/>Introduction xiii<br/><br/>Chapter 1. Annotating Collaboratively 1<br/><br/>1.1. The annotation process (re)visited 1<br/><br/>1.1.1. Building consensus 1<br/><br/>1.1.2. Existing methodologies 3<br/><br/>1.1.3. Preparatory work 7<br/><br/>1.1.4. Pre-campaign 13<br/><br/>1.1.5. Annotation 17<br/><br/>1.1.6. Finalization 21<br/><br/>1.2. Annotation complexity 24<br/><br/>1.2.1. Example overview 25<br/><br/>1.2.2. What to annotate? 28<br/><br/>1.2.3. How to annotate? 30<br/><br/>1.2.4. The weight of the context 36<br/><br/>1.2.5. Visualization 38<br/><br/>1.2.6. Elementary annotation tasks 40<br/><br/>1.3. Annotation tools 43<br/><br/>1.3.1. To be or not to be an annotation tool 43<br/><br/>1.3.2. Much more than prototypes 46<br/><br/>1.3.3. Addressing the new annotation challenges 49<br/><br/>1.3.4. The impossible dream tool 54<br/><br/>1.4. Evaluating the annotation quality 55<br/><br/>1.4.1. What is annotation quality? 55<br/><br/>1.4.2. Understanding the basics 56<br/><br/>1.4.3. Beyond kappas 63<br/><br/>1.4.4. Giving meaning to the metrics 67<br/><br/>1.5. Conclusion 75<br/><br/>Chapter 2. Crowdsourcing Annotation 77<br/><br/>2.1. What is crowdsourcing and why should we be interested in it? 77<br/><br/>2.1.1. A moving target 77<br/><br/>2.1.2. A massive success 80<br/><br/>2.2. Deconstructing the myths 81<br/><br/>2.2.1. Crowdsourcing is a recent phenomenon 81<br/><br/>2.2.2. Crowdsourcing involves a crowd (of non-experts) 83<br/><br/>2.2.3. “Crowdsourcing involves (a crowd of) non-experts” 87<br/><br/>2.3. Playing with a purpose 93<br/><br/>2.3.1. Using the players’ innate capabilities and world knowledge 94<br/><br/>2.3.2. Using the players’ school knowledge 96<br/><br/>2.3.3. Using the players’ learning capacities 97<br/><br/>2.4. Acknowledging crowdsourcing specifics 101<br/><br/>2.4.1. Motivating the participants 101<br/><br/>2.4.2. Producing quality data 107<br/><br/>2.5. Ethical issues 109<br/><br/>2.5.1. Game ethics 109<br/><br/>2.5.2. What’s wrong with Amazon Mechanical Turk? 111<br/><br/>2.5.3. A charter to rule them all 113<br/><br/>Conclusion 115<br/><br/>Appendix 117<br/><br/>Glossary 141<br/><br/>Bibliography 143<br/><br/>Index 163
520 ## - SUMMARY, ETC.
Summary, etc This book presents a unique opportunity for constructing a consistent image of collaborative manual annotation for Natural Language Processing (NLP). NLP has witnessed two major evolutions in the past 25 years: firstly, the extraordinary success of machine learning, which is now, for better or for worse, overwhelmingly dominant in the field, and secondly, the multiplication of evaluation campaigns or shared tasks. Both involve manually annotated corpora, for the training and evaluation of the systems.<br/><br/>These corpora have progressively become the hidden pillars of our domain, providing food for our hungry machine learning algorithms and reference for evaluation. Annotation is now the place where linguistics hides in NLP. However, manual annotation has largely been ignored for some time, and it has taken a while even for annotation guidelines to be recognized as essential.
545 0# - BIOGRAPHICAL OR HISTORICAL DATA
Biographical or historical note About the Author<br/>Karën Fort is Associate Professor at University Paris-Sorbonne (Paris 4) working on the STIH (meaning, text, computer science, history) team. Her current research interests include collaborative manual annotation, crowdsourcing and ethics.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Natural language processing (Computer science)
Authority record control number http://id.loc.gov/authorities/subjects/sh88002425.
655 #4 - INDEX TERM--GENRE/FORM
Genre/form data or focus term Electronic books.
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE
Uniform title Focus series (London, England)
Authority record control number http://id.loc.gov/authorities/names/n2014186952.
856 40 - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier https://onlinelibrary.wiley.com/doi/book/10.1002/9781119306696
Link text Full text is available at Wiley Online Library Click here to view
942 ## - ADDED ENTRY ELEMENTS
Source of classification or shelving scheme
Item type EBOOK
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Damaged status Not for loan Permanent Location Current Location Date acquired Source of acquisition Inventory number Full call number Barcode Date last seen Price effective from Item type
          COLLEGE LIBRARY COLLEGE LIBRARY 2024-12-05 Megatexts Phil. Inc. 52302 006.35 F7756 2016 CL-52302 2024-12-05 2024-12-05 EBOOK