When species matches are unavailable are DNA barcodes correctly assigned to Higher Taxa? an assessment using sphingid moths

Background: When a specimen belongs to a species not yet represented in DNA barcode reference libraries there is disagreement over the effectiveness of using sequence comparisons to assign the query accurately to a higher taxon. Library completeness and the assignment criteria used have been propose...

Full description

Saved in:
Bibliographic Details
Main Authors: Wilson, J.J., Schonfeld, J., Janzen, D.H., Hallwachs, W., Hajibabaei, M., Kitching, I.J., Haxaire, J., Hebert, P.D.N., Rougerie, R.
Format: Article
Published: 2011
Subjects:
Online Access:http://eprints.um.edu.my/4630/
http://www.scopus.com/inward/record.url?eid=2-s2.0-79960862758&partnerID=40&md5=9cd98d29a761b9cde693679a993a16b5
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.4630
record_format eprints
spelling my.um.eprints.46302014-12-26T02:43:59Z http://eprints.um.edu.my/4630/ When species matches are unavailable are DNA barcodes correctly assigned to Higher Taxa? an assessment using sphingid moths Wilson, J.J. Schonfeld, J. Janzen, D.H. Hallwachs, W. Hajibabaei, M. Kitching, I.J. Haxaire, J. Hebert, P.D.N. Rougerie, R. QH301 Biology Background: When a specimen belongs to a species not yet represented in DNA barcode reference libraries there is disagreement over the effectiveness of using sequence comparisons to assign the query accurately to a higher taxon. Library completeness and the assignment criteria used have been proposed as critical factors affecting the accuracy of such assignments but have not been thoroughly investigated. We explored the accuracy of assignments to genus, tribe and subfamily in the Sphingidae, using the almost complete global DNA barcode reference library (1095 species) available for this family. Costa Rican sphingids (118 species), a well-documented, diverse subset of the family, with each of the tribes and subfamilies represented were used as queries. We simulated libraries with different levels of completeness (10-100% of the available species), and recorded assignments (positive or ambiguous) and their accuracy (true or false) under six criteria.Results: A liberal tree-based criterion assigned 83% of queries accurately to genus, 74% to tribe and 90% to subfamily, compared to a strict tree-based criterion, which assigned 75% of queries accurately to genus, 66% to tribe and 84% to subfamily, with a library containing 100% of available species (but excluding the species of the query). The greater number of true positives delivered by more relaxed criteria was negatively balanced by the occurrence of more false positives. This effect was most sharply observed with libraries of the lowest completeness where, for example at the genus level, 32% of assignments were false positives with the liberal criterion versus < 1% when using the strict. We observed little difference (< 8% using the liberal criterion) however, in the overall accuracy of the assignments between the lowest and highest levels of library completeness at the tribe and subfamily level.Conclusions: Our results suggest that when using a strict tree-based criterion for higher taxon assignment with DNA barcodes, the likelihood of assigning a query a genus name incorrectly is very low, if a genus name is provided it has a high likelihood of being accurate, and if no genus match is available the query can nevertheless be assigned to a subfamily with high accuracy regardless of library completeness. DNA barcoding often correctly assigned sphingid moths to higher taxa when species matches were unavailable, suggesting that barcode reference libraries can be useful for higher taxon assignments long before they achieve complete species coverage. © 2011 Wilson et al; licensee BioMed Central Ltd. 2011 Article PeerReviewed Wilson, J.J. and Schonfeld, J. and Janzen, D.H. and Hallwachs, W. and Hajibabaei, M. and Kitching, I.J. and Haxaire, J. and Hebert, P.D.N. and Rougerie, R. (2011) When species matches are unavailable are DNA barcodes correctly assigned to Higher Taxa? an assessment using sphingid moths. BMC Ecology, 11. p. 18. ISSN 14726785 http://www.scopus.com/inward/record.url?eid=2-s2.0-79960862758&partnerID=40&md5=9cd98d29a761b9cde693679a993a16b5 10.1186/1472-6785-11-18
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QH301 Biology
spellingShingle QH301 Biology
Wilson, J.J.
Schonfeld, J.
Janzen, D.H.
Hallwachs, W.
Hajibabaei, M.
Kitching, I.J.
Haxaire, J.
Hebert, P.D.N.
Rougerie, R.
When species matches are unavailable are DNA barcodes correctly assigned to Higher Taxa? an assessment using sphingid moths
description Background: When a specimen belongs to a species not yet represented in DNA barcode reference libraries there is disagreement over the effectiveness of using sequence comparisons to assign the query accurately to a higher taxon. Library completeness and the assignment criteria used have been proposed as critical factors affecting the accuracy of such assignments but have not been thoroughly investigated. We explored the accuracy of assignments to genus, tribe and subfamily in the Sphingidae, using the almost complete global DNA barcode reference library (1095 species) available for this family. Costa Rican sphingids (118 species), a well-documented, diverse subset of the family, with each of the tribes and subfamilies represented were used as queries. We simulated libraries with different levels of completeness (10-100% of the available species), and recorded assignments (positive or ambiguous) and their accuracy (true or false) under six criteria.Results: A liberal tree-based criterion assigned 83% of queries accurately to genus, 74% to tribe and 90% to subfamily, compared to a strict tree-based criterion, which assigned 75% of queries accurately to genus, 66% to tribe and 84% to subfamily, with a library containing 100% of available species (but excluding the species of the query). The greater number of true positives delivered by more relaxed criteria was negatively balanced by the occurrence of more false positives. This effect was most sharply observed with libraries of the lowest completeness where, for example at the genus level, 32% of assignments were false positives with the liberal criterion versus < 1% when using the strict. We observed little difference (< 8% using the liberal criterion) however, in the overall accuracy of the assignments between the lowest and highest levels of library completeness at the tribe and subfamily level.Conclusions: Our results suggest that when using a strict tree-based criterion for higher taxon assignment with DNA barcodes, the likelihood of assigning a query a genus name incorrectly is very low, if a genus name is provided it has a high likelihood of being accurate, and if no genus match is available the query can nevertheless be assigned to a subfamily with high accuracy regardless of library completeness. DNA barcoding often correctly assigned sphingid moths to higher taxa when species matches were unavailable, suggesting that barcode reference libraries can be useful for higher taxon assignments long before they achieve complete species coverage. © 2011 Wilson et al; licensee BioMed Central Ltd.
format Article
author Wilson, J.J.
Schonfeld, J.
Janzen, D.H.
Hallwachs, W.
Hajibabaei, M.
Kitching, I.J.
Haxaire, J.
Hebert, P.D.N.
Rougerie, R.
author_facet Wilson, J.J.
Schonfeld, J.
Janzen, D.H.
Hallwachs, W.
Hajibabaei, M.
Kitching, I.J.
Haxaire, J.
Hebert, P.D.N.
Rougerie, R.
author_sort Wilson, J.J.
title When species matches are unavailable are DNA barcodes correctly assigned to Higher Taxa? an assessment using sphingid moths
title_short When species matches are unavailable are DNA barcodes correctly assigned to Higher Taxa? an assessment using sphingid moths
title_full When species matches are unavailable are DNA barcodes correctly assigned to Higher Taxa? an assessment using sphingid moths
title_fullStr When species matches are unavailable are DNA barcodes correctly assigned to Higher Taxa? an assessment using sphingid moths
title_full_unstemmed When species matches are unavailable are DNA barcodes correctly assigned to Higher Taxa? an assessment using sphingid moths
title_sort when species matches are unavailable are dna barcodes correctly assigned to higher taxa? an assessment using sphingid moths
publishDate 2011
url http://eprints.um.edu.my/4630/
http://www.scopus.com/inward/record.url?eid=2-s2.0-79960862758&partnerID=40&md5=9cd98d29a761b9cde693679a993a16b5
_version_ 1643687380230602752
score 13.18916