Get Free Shipping on orders over $79
Statistical Significance Testing for Natural Language Processing - Rotem Dror

Statistical Significance Testing for Natural Language Processing

By: Rotem Dror, Lotem Peled-Cohen, Segev Shlomov, Roi Reichart

eText | 1 June 2022

At a Glance

eText


$89.99

or 4 interest-free payments of $22.50 with

 or 

Instant online reading in your Booktopia eTextbook Library *

Why choose an eTextbook?

Instant Access *

Purchase and read your book immediately

Read Aloud

Listen and follow along as Bookshelf reads to you

Study Tools

Built-in study tools like highlights and more

* eTextbooks are not downloadable to your eReader or an app and can be accessed via web browsers only. You must be connected to the internet and have no technical issues with your device or browser that could prevent the eTextbook from operating.
Data-driven experimental analysis has become the main evaluation tool of Natural Language Processing (NLP) algorithms. In fact, in the last decade, it has become rare to see an NLP paper, particularly one that proposes a new algorithm, that does not include extensive experimental analysis, and the number of involved tasks, datasets, domains, and languages is constantly growing. This emphasis on empirical results highlights the role of statistical significance testing in NLP research: If we, as a community, rely on empirical evaluation to validate our hypotheses and reveal the correct language processing mechanisms, we better be sure that our results are not coincidental. The goal of this book is to discuss the main aspects of statistical significance testing in NLP. Our guiding assumption throughout the book is that the basic question NLP researchers and engineers deal with is whether or not one algorithm can be considered better than another one. This question drives the field forward as it allows the constant progress of developing better technology for language processing challenges. In practice, researchers and engineers would like to draw the right conclusion from a limited set of experiments, and this conclusion should hold for other experiments with datasets they do not have at their disposal or that they cannot perform due to limited time and resources. The book hence discusses the opportunities and challenges in using statistical significance testing in NLP, from the point of view of experimental comparison between two algorithms. We cover topics such as choosing an appropriate significance test for the major NLP tasks, dealing with the unique aspects of significance testing for non-convex deep neural networks, accounting for a large number of comparisons between two NLP algorithms in a statistically valid manner (multiple hypothesis testing), and, finally, the unique challenges yielded by the nature of the data and practices of the field.
on
Desktop
Tablet
Mobile

More in Artificial Intelligence

Medium Hot : Images in the Age of Heat - Hito Steyerl

eBOOK

RRP $22.66

$18.99

16%
OFF
AI Futures - Evgeny Morozov

eBOOK

RRP $16.88

$13.99

17%
OFF
AI-Powered Search - Trey Grainger

eBOOK

HBR Guide to Generative AI for Managers : HBR Guide - Elisa Farri

eBOOK

AI : The End of Human Race - Alex Wood

eBOOK