Statistical approaches to account for false-positive errors in environmental DNA samples

José J. Lahoz-Monfort, Gurutzeta Guillera-Arroita, Reid Tingley

Research output: Contribution to journalArticleResearchpeer-review

72 Citations (Scopus)

Abstract

Environmental DNA (eDNA) sampling is prone to both false-positive and false-negative errors. We review statistical methods to account for such errors in the analysis of eDNA data and use simulations to compare the performance of different modelling approaches. Our simulations illustrate that even low false-positive rates can produce biased estimates of occupancy and detectability. We further show that removing or classifying single PCR detections in an ad hoc manner under the suspicion that such records represent false positives, as sometimes advocated in the eDNA literature, also results in biased estimation of occupancy, detectability and false-positive rates. We advocate alternative approaches to account for false-positive errors that rely on prior information, or the collection of ancillary detection data at a subset of sites using a sampling method that is not prone to false-positive errors. We illustrate the advantages of these approaches over ad hoc classifications of detections and provide practical advice and code for fitting these models in maximum likelihood and Bayesian frameworks. Given the severe bias induced by false-negative and false-positive errors, the methods presented here should be more routinely adopted in eDNA studies.

Original languageEnglish
Pages (from-to)673-685
Number of pages13
JournalMolecular Ecology Resources
Volume16
Issue number3
DOIs
Publication statusPublished - 1 May 2016
Externally publishedYes

Keywords

  • Detectability
  • False negatives
  • Imperfect detection
  • Occupancy

Cite this

Lahoz-Monfort, José J. ; Guillera-Arroita, Gurutzeta ; Tingley, Reid. / Statistical approaches to account for false-positive errors in environmental DNA samples. In: Molecular Ecology Resources. 2016 ; Vol. 16, No. 3. pp. 673-685.
@article{67256d5d754944d1839bdde0265283fa,
title = "Statistical approaches to account for false-positive errors in environmental DNA samples",
abstract = "Environmental DNA (eDNA) sampling is prone to both false-positive and false-negative errors. We review statistical methods to account for such errors in the analysis of eDNA data and use simulations to compare the performance of different modelling approaches. Our simulations illustrate that even low false-positive rates can produce biased estimates of occupancy and detectability. We further show that removing or classifying single PCR detections in an ad hoc manner under the suspicion that such records represent false positives, as sometimes advocated in the eDNA literature, also results in biased estimation of occupancy, detectability and false-positive rates. We advocate alternative approaches to account for false-positive errors that rely on prior information, or the collection of ancillary detection data at a subset of sites using a sampling method that is not prone to false-positive errors. We illustrate the advantages of these approaches over ad hoc classifications of detections and provide practical advice and code for fitting these models in maximum likelihood and Bayesian frameworks. Given the severe bias induced by false-negative and false-positive errors, the methods presented here should be more routinely adopted in eDNA studies.",
keywords = "Detectability, False negatives, Imperfect detection, Occupancy",
author = "Lahoz-Monfort, {Jos{\'e} J.} and Gurutzeta Guillera-Arroita and Reid Tingley",
year = "2016",
month = "5",
day = "1",
doi = "10.1111/1755-0998.12486",
language = "English",
volume = "16",
pages = "673--685",
journal = "Molecular Ecology Resources",
issn = "1755-098X",
publisher = "Wiley-Blackwell",
number = "3",

}

Statistical approaches to account for false-positive errors in environmental DNA samples. / Lahoz-Monfort, José J.; Guillera-Arroita, Gurutzeta; Tingley, Reid.

In: Molecular Ecology Resources, Vol. 16, No. 3, 01.05.2016, p. 673-685.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - Statistical approaches to account for false-positive errors in environmental DNA samples

AU - Lahoz-Monfort, José J.

AU - Guillera-Arroita, Gurutzeta

AU - Tingley, Reid

PY - 2016/5/1

Y1 - 2016/5/1

N2 - Environmental DNA (eDNA) sampling is prone to both false-positive and false-negative errors. We review statistical methods to account for such errors in the analysis of eDNA data and use simulations to compare the performance of different modelling approaches. Our simulations illustrate that even low false-positive rates can produce biased estimates of occupancy and detectability. We further show that removing or classifying single PCR detections in an ad hoc manner under the suspicion that such records represent false positives, as sometimes advocated in the eDNA literature, also results in biased estimation of occupancy, detectability and false-positive rates. We advocate alternative approaches to account for false-positive errors that rely on prior information, or the collection of ancillary detection data at a subset of sites using a sampling method that is not prone to false-positive errors. We illustrate the advantages of these approaches over ad hoc classifications of detections and provide practical advice and code for fitting these models in maximum likelihood and Bayesian frameworks. Given the severe bias induced by false-negative and false-positive errors, the methods presented here should be more routinely adopted in eDNA studies.

AB - Environmental DNA (eDNA) sampling is prone to both false-positive and false-negative errors. We review statistical methods to account for such errors in the analysis of eDNA data and use simulations to compare the performance of different modelling approaches. Our simulations illustrate that even low false-positive rates can produce biased estimates of occupancy and detectability. We further show that removing or classifying single PCR detections in an ad hoc manner under the suspicion that such records represent false positives, as sometimes advocated in the eDNA literature, also results in biased estimation of occupancy, detectability and false-positive rates. We advocate alternative approaches to account for false-positive errors that rely on prior information, or the collection of ancillary detection data at a subset of sites using a sampling method that is not prone to false-positive errors. We illustrate the advantages of these approaches over ad hoc classifications of detections and provide practical advice and code for fitting these models in maximum likelihood and Bayesian frameworks. Given the severe bias induced by false-negative and false-positive errors, the methods presented here should be more routinely adopted in eDNA studies.

KW - Detectability

KW - False negatives

KW - Imperfect detection

KW - Occupancy

UR - http://www.scopus.com/inward/record.url?scp=84949908582&partnerID=8YFLogxK

U2 - 10.1111/1755-0998.12486

DO - 10.1111/1755-0998.12486

M3 - Article

C2 - 26558345

AN - SCOPUS:84949908582

VL - 16

SP - 673

EP - 685

JO - Molecular Ecology Resources

JF - Molecular Ecology Resources

SN - 1755-098X

IS - 3

ER -