{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T12:29:53Z","timestamp":1773664193105,"version":"3.50.1"},"reference-count":14,"publisher":"World Scientific Pub Co Pte Lt","issue":"04","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Bioinform. Comput. Biol."],"published-print":{"date-parts":[[2008,8]]},"abstract":"<jats:p> Expressed sequence tags (ESTs) represent 500\u20131000-bp-long sequences corresponding to mRNAs derived from different sources (cell lines, tissues, etc.). The human EST database contains over 8,000,000 sequences, with over 4,000,000,000 total nucleotides. RNA molecules are transcribed from a genomic DNA template; therefore, all ESTs should match corresponding genomes. Nevertheless, we have found in the human EST database approximately 11,000 ESTs not matching sequences in the human genome database. The presence of \"trash\" ESTs (TESTs) in the EST database could result from DNA or RNA contamination of the laboratory equipment, tissues, or cell lines. TESTs could also represent sequences from unidentified human genes or from species inhabiting the human body. Here, we attempt to identify the sources of human EST database contaminations. In particular, we discuss systematic contamination of the mammalian EST databases with sequences of plants. <\/jats:p>","DOI":"10.1142\/s0219720008003709","type":"journal-article","created":{"date-parts":[[2008,9,2]],"date-time":"2008-09-02T11:01:05Z","timestamp":1220353265000},"page":"759-773","source":"Crossref","is-referenced-by-count":3,"title":["HUMAN TRASH ESTs \u2014 SEQUENCES FROM cDNA COLLECTION THAT ARE NOT ALIGNED TO GENOME ASSEMBLY"],"prefix":"10.1142","volume":"06","author":[{"given":"ALEXANDER Y.","family":"PANCHIN","sequence":"first","affiliation":[{"name":"Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia"},{"name":"Department of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia"}]},{"given":"SERGEY A.","family":"SPIRIN","sequence":"additional","affiliation":[{"name":"A.N. Belozersky Institute of Physico-Chemical Biology, Moscow State University, Moscow, Russia"}]},{"given":"SERGEY A.","family":"LUKYANOV","sequence":"additional","affiliation":[{"name":"Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia"}]},{"given":"YURI B.","family":"LEBEDEV","sequence":"additional","affiliation":[{"name":"Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia"}]},{"given":"YURI V.","family":"PANCHIN","sequence":"additional","affiliation":[{"name":"A.N. Belozersky Institute of Physico-Chemical Biology, Moscow State University, Moscow, Russia"},{"name":"Institute of Problems of Information Transmission, Russian Academy of Sciences, Moscow, Russia"}]}],"member":"219","published-online":{"date-parts":[[2011,11,21]]},"reference":[{"key":"rf1","doi-asserted-by":"publisher","DOI":"10.1038\/ng0893-332"},{"key":"rf2","doi-asserted-by":"publisher","DOI":"10.1016\/0022-2836(72)90458-5"},{"key":"rf3","doi-asserted-by":"publisher","DOI":"10.1016\/0092-8674(76)90121-5"},{"key":"rf4","doi-asserted-by":"publisher","DOI":"10.1016\/S0092-8674(02)01137-6"},{"key":"rf5","doi-asserted-by":"publisher","DOI":"10.1126\/science.1058040"},{"key":"rf6","doi-asserted-by":"publisher","DOI":"10.1038\/35057062"},{"key":"rf7","doi-asserted-by":"publisher","DOI":"10.1126\/science.1093857"},{"key":"rf8","doi-asserted-by":"publisher","DOI":"10.1159\/000095920"},{"key":"rf9","doi-asserted-by":"publisher","DOI":"10.1038\/nature05329"},{"key":"rf10","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.95.26.15502"},{"key":"rf11","doi-asserted-by":"publisher","DOI":"10.1038\/35888"},{"key":"rf12","doi-asserted-by":"publisher","DOI":"10.1126\/science.1087117"},{"key":"rf13","doi-asserted-by":"publisher","DOI":"10.1016\/j.bbrc.2005.03.199"},{"key":"rf14","volume-title":"The Cell: A Molecular Approach","author":"Geoffrey M. C.","year":"2000"}],"container-title":["Journal of Bioinformatics and Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0219720008003709","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,7]],"date-time":"2019-08-07T02:42:31Z","timestamp":1565145751000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0219720008003709"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,8]]},"references-count":14,"journal-issue":{"issue":"04","published-online":{"date-parts":[[2011,11,21]]},"published-print":{"date-parts":[[2008,8]]}},"alternative-id":["10.1142\/S0219720008003709"],"URL":"https:\/\/doi.org\/10.1142\/s0219720008003709","relation":{},"ISSN":["0219-7200","1757-6334"],"issn-type":[{"value":"0219-7200","type":"print"},{"value":"1757-6334","type":"electronic"}],"subject":[],"published":{"date-parts":[[2008,8]]}}}