{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T22:33:09Z","timestamp":1777761189259,"version":"3.51.4"},"reference-count":219,"publisher":"Emerald","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,10,12]]},"abstract":"<jats:p>This survey provides an exposition of a suite of techniques based on the theory of polynomials, collectively referred to as polynomial methods, which have recently been applied to address several challenging problems in statistical inference successfully. Topics including polynomial approximation, polynomial interpolation and majorization, moment space and positive polynomials, orthogonal polynomials and Gaussian quadrature are discussed, with their major probabilistic and statistical applications in property estimation on large domains and learning mixture models. These techniques provide useful tools not only for the design of highly practical algorithms with provable optimality, but also for establishing the fundamental limits of the inference problems through the method of moment matching. The effectiveness of the polynomial method is demonstrated in concrete problems such as entropy and support size estimation, distinct elements problem, and learning Gaussian mixture models.<\/jats:p>","DOI":"10.1561\/0100000095","type":"journal-article","created":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T05:41:57Z","timestamp":1602481317000},"page":"402-585","source":"Crossref","is-referenced-by-count":8,"title":["Polynomial Methods in Statistical Inference: Theory and Practice"],"prefix":"10.1108","volume":"17","author":[{"given":"Yihong","family":"Wu","sequence":"first","affiliation":[{"name":"Department of Statistics and Data Science , Yale University,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pengkun","family":"Yang","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering , Princeton University,","place":["USA"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"140","published-online":{"date-parts":[[2020,10,12]]},"reference":[{"key":"2026032712175530700_ref001","first-page":"265","volume-title":"12th USENIX Symposium on Operating Systems Design and Implementation (OSD1 16)","author":"Abadi","year":"2016"},{"key":"2026032712175530700_ref002","volume-title":"Handbook of Mathematical Functions: With Formulas, Graphs, and Mathematical Tables","author":"Abramowitz","year":"1964"},{"key":"2026032712175530700_ref003","first-page":"11","volume-title":"Proceedings of the 34th International Conference on Machine Learning","author":"Acharya","year":"2017"},{"key":"2026032712175530700_ref004","first-page":"1855","volume-title":"Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms","author":"Acharya","year":"2015"},{"key":"2026032712175530700_ref005","first-page":"458","volume-title":"Proceedings of the 18th Annual Conference on Learning Theory (COLT 2005)","author":"Achlioptas","year":"2005"},{"key":"2026032712175530700_ref006","volume-title":"The Classical Moment Problem: And Some Related Questions in Analysis","author":"Akhiezer","year":"1965"},{"issue":"3","key":"2026032712175530700_ref007","volume":"2007","author":"Aktulga","year":"2007","journal-title":"EURASIP Journal on Bioinformatics and Systems Biology"},{"issue":"21","key":"2026032712175530700_ref008","doi-asserted-by":"crossref","first-page":"6556","DOI":"10.1093\/imrn\/rnx090","article-title":"\u201cAlg\u00e9braic identifiability of Gaussian mixtures\u201d","volume":"2018","author":"Amendola","year":"2016","journal-title":"International Mathematics Research Notices"},{"issue":"4","key":"2026032712175530700_ref009","doi-asserted-by":"crossref","first-page":"717","DOI":"10.1162\/089976604322860677","article-title":"\u201cEstimating the entropy rate of spike trains via Lempel-Ziv complexity\u201d","volume":"16","author":"Amig\u00f3","year":"2004","journal-title":"Neural Computation"},{"issue":"1","key":"2026032712175530700_ref010","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1111\/j.2517-6161.1974.tb00989.x","article-title":"\u201cScale mixtures of normal distributions\u201d","volume":"36","author":"Andrews","year":"1974","journal-title":"Journal of the Royal Statistical Society: Series B (Methodological)"},{"key":"2026032712175530700_ref011","first-page":"247","volume-title":"Proceedings of the Thirty-Third Annual ACM Symposium on Theory of Computing","author":"Arora","year":"2001"},{"key":"2026032712175530700_ref012","volume-title":"An Introduction to Numerical Analysis","author":"Atkinson","year":"1989"},{"key":"2026032712175530700_ref013","volume-title":"Applications of Information Theory to Psychology: A Summary of Basic Concepts, Methods, and Results","author":"Attneave","year":"1959"},{"issue":"1","key":"2026032712175530700_ref014","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1214\/16-AOS1435","article-title":"\u201cStatistical guarantees for the EM algorithm: From population to sample-based analysis\u201d","volume":"45","author":"Balakrishnan","year":"2017","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref015","author":"Bandeira","year":"2017"},{"key":"2026032712175530700_ref016","first-page":"1","volume-title":"Proceedings of the 6th Randomization and Approximation Techniques in Computer Science","author":"Bar-Yossef","year":"2002"},{"key":"2026032712175530700_ref017","doi-asserted-by":"crossref","first-page":"266","DOI":"10.1145\/380752.380810","volume-title":"Proceedings of the Thirty-Third Annual ACM Symposium on Theory of Computing","author":"Bar-Yossef","year":"2001"},{"issue":"3","key":"2026032712175530700_ref018","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1137\/1104033","article-title":"\u201cOn a statistical estimate for the entropy of a sequence of independent random variables\u201d","volume":"4","author":"Basharin","year":"1959","journal-title":"Theory of Probability & Its Applications"},{"key":"2026032712175530700_ref019","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1109\/FOCS.2010.16","volume-title":"2010 51st Annual IEEE Symposium on Foundations of Computer Science (FOCS)","author":"Belkin","year":"2010"},{"key":"2026032712175530700_ref020","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1145\/1644893.1644900","volume-title":"Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement","author":"Benevenuto","year":"2009"},{"issue":"3","key":"2026032712175530700_ref021","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1214\/aos\/1176345003","article-title":"\u201cMinimum chi-square, not maximum likelihood! (with discussion)\u201d","volume":"8","author":"Berkson","year":"1980","journal-title":"The Annals of Statistics"},{"issue":"10","key":"2026032712175530700_ref022","doi-asserted-by":"crossref","first-page":"5411","DOI":"10.1073\/pnas.94.10.5411","article-title":"\u201cThe structure and precision of retinal spike trains\u201d","volume":"94","author":"Berry","year":"1997","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"2026032712175530700_ref023","first-page":"109","volume-title":"Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP","author":"Bhat","year":"2009"},{"issue":"2","key":"2026032712175530700_ref024","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1007\/BF00532480","article-title":"\u201cApproximation dans les espaces m\u00e9triques et th\u00e9orie de l\u2019estimation\u201d","volume":"65","author":"Birg\u00e9","year":"1983","journal-title":"Z. f\u00fcr Wahrscheinlichkeitstheorie und Verw. Geb"},{"key":"2026032712175530700_ref025","doi-asserted-by":"crossref","first-page":"380","DOI":"10.1007\/3-540-36169-3_30","volume-title":"Algorithmic Learning Theory","author":"Braess","year":"2002"},{"issue":"2","key":"2026032712175530700_ref026","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1016\/j.jat.2004.04.010","article-title":"\u201cBernstein polynomials and learning theory\u201d","volume":"128","author":"Braess","year":"2004","journal-title":"Journal of Approximation Theory"},{"key":"2026032712175530700_ref027","doi-asserted-by":"crossref","first-page":"771","DOI":"10.1145\/2746539.2746631","volume-title":"Proceedings of the Forty-Seventh Annual ACM Symposium on Theory of Computing","author":"Bresler","year":"2015"},{"key":"2026032712175530700_ref028","first-page":"551","volume-title":"IEEE 49th Annual IEEE Symposium on Foundations of Computer Science, 2008","author":"Brubaker","year":"2008"},{"issue":"6","key":"2026032712175530700_ref029","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1007\/BF01087176","article-title":"\u201cSub-Gaussian random variables\u201d","volume":"32","author":"Buldygin","year":"1980","journal-title":"Ukrainian Mathematical Journal"},{"issue":"421","key":"2026032712175530700_ref030","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1080\/01621459.1993.10594330","article-title":"\u201cEstimating the number of species: A review\u201d","volume":"88","author":"Bunge","year":"1993","journal-title":"Journal of the American Statistical Association"},{"issue":"5","key":"2026032712175530700_ref031","doi-asserted-by":"crossref","first-page":"927","DOI":"10.2307\/1936861","article-title":"\u201cRobust estimation of population size when capture probabilities vary among animals\u201d","volume":"60","author":"Burnham","year":"1979","journal-title":"Ecology"},{"key":"2026032712175530700_ref032","volume-title":"Algebraic Approximation: A Guide to Past and Current Solutions","author":"Bustamante","year":"2011"},{"issue":"6","key":"2026032712175530700_ref033","doi-asserted-by":"crossref","first-page":"2930","DOI":"10.1214\/009053605000000147","article-title":"\u201cNonquadratic estimators of a quadratic functional\u201d","volume":"33","author":"Cai","year":"2005","journal-title":"The Annals of Statistics"},{"issue":"2","key":"2026032712175530700_ref034","doi-asserted-by":"crossref","first-page":"1012","DOI":"10.1214\/10-AOS849","article-title":"\u201cTesting composite hypotheses, hermite polynomials and optimal estimation of a nonsmooth functional\u201d","volume":"39","author":"Cai","year":"2011","journal-title":"The Annals of Statistics"},{"issue":"4","key":"2026032712175530700_ref035","first-page":"265","article-title":"\u201cNonparametric estimation of the number of classes in a population\u201d","volume":"11","author":"Chao","year":"1984","journal-title":"Scandinavian Journal of Statistics"},{"issue":"417","key":"2026032712175530700_ref036","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1080\/01621459.1992.10475194","article-title":"\u201cEstimating the number of classes via sample coverage\u201d","volume":"87","author":"Chao","year":"1992","journal-title":"Journal of the American Statistical Association"},{"key":"2026032712175530700_ref037","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1145\/335168.335230","volume-title":"Proceedings of the Nineteenth ACM SIGMOD-SIGACTSIGART Symposium on Principles of Database Systems (PODS)","author":"Charikar","year":"2000"},{"key":"2026032712175530700_ref038","doi-asserted-by":"crossref","unstructured":"Chauss\u00e9, P.\n           (2010). \u201cComputing generalized method of moments and generalized empirical likelihood with R\u201d. Journal of Statistical Software. 34(11): 1\u201335. URL: http:\/\/www.jstatsoft.org\/v34\/ill\/.","DOI":"10.18637\/jss.v034.i11"},{"issue":"1","key":"2026032712175530700_ref039","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1214\/aos\/1176324464","article-title":"\u201cOptimal rate of convergence for finite mixture models\u201d","volume":"23","author":"Chen","year":"1995","journal-title":"The Annals of Statistics"},{"issue":"3","key":"2026032712175530700_ref040","doi-asserted-by":"crossref","first-page":"462","DOI":"10.1109\/TIT.1968.1054142","article-title":"\u201cApproximating discrete probability distributions with dependence trees\u201d","volume":"14","author":"Chow","year":"1968","journal-title":"IEEE Trans. Inf. Theory"},{"key":"2026032712175530700_ref041","volume-title":"Information Theory: Coding Theorems for Discrete Memoryless Systems","author":"Csiszar","year":"1982"},{"issue":"4","key":"2026032712175530700_ref042","first-page":"603","article-title":"\u201cRecursiveness, positivity, and truncated moment problems\u201d","volume":"17","author":"Curto","year":"1991","journal-title":"Houston Journal of Mathematics"},{"key":"2026032712175530700_ref043","doi-asserted-by":"crossref","first-page":"149","DOI":"10.2307\/2530505","article-title":"\u201cA note on capture-recapture estimation\u201d","volume":"36","author":"Darroch","year":"1980","journal-title":"Biometrics"},{"key":"2026032712175530700_ref044","first-page":"634","volume-title":"40th Annual Symposium on Foundations of Computer Science, 1999","author":"Dasgupta","year":"1999"},{"key":"2026032712175530700_ref045","first-page":"704","volume-title":"Proceedings of the 30th Annual Conference on Learning Theory (COLT 2017)","author":"Daskalakis","year":"2017"},{"key":"2026032712175530700_ref046","volume-title":"Interpolation and Approximation","author":"Davis","year":"1975"},{"key":"2026032712175530700_ref047","first-page":"46","volume":"1","author":"de Boor","year":"2005","journal-title":"Surveys in Approximation Theory"},{"issue":"1","key":"2026032712175530700_ref048","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1214\/aoms\/1177698536","article-title":"\u201cConstruction of sequences estimating the mixing distribution\u201d","volume":"39","author":"Deely","year":"1968","journal-title":"The Annals of Mathematical Statistics"},{"issue":"1","key":"2026032712175530700_ref049","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"\u201cMaximum likelihood from incomplete data via the EM algorithm\u201d","volume":"39","author":"Dempster","year":"1977","journal-title":"Journal of the Royal Statistical Society: Series B (Methodological)"},{"key":"2026032712175530700_ref050","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-662-02888-9","volume-title":"Constructive Approximation","author":"DeVore","year":"1993"},{"key":"2026032712175530700_ref051","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1090\/psapm\/037\/921087","volume-title":"Moments in Mathematics","author":"Diaconis","year":"1987"},{"issue":"2","key":"2026032712175530700_ref052","doi-asserted-by":"crossref","first-page":"1072","DOI":"10.1016\/j.jmaa.2015.08.034","article-title":"\u201cBounds for the logarithm of the Euler gamma function and its derivatives\u201d","volume":"433","author":"Diamond","year":"2016","journal-title":"Journal of Mathematical Analysis and Applications"},{"key":"2026032712175530700_ref053","volume-title":"Moduli of Smoothness","author":"Ditzian","year":"2012"},{"issue":"2","key":"2026032712175530700_ref054","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1137\/1103015","article-title":"\u201cA statistical problem arising in the theory of detection of signals in the presence of noise in a multi-channel system and leading to stable distribution laws\u201d","volume":"3","author":"Dobrushin","year":"1958","journal-title":"Theory of Probability & Its Applications"},{"key":"2026032712175530700_ref055","first-page":"668","article-title":"\u201cGeometrizing rates of convergence, II\u201d","volume":"19","author":"Donoho","year":"1991","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref056","author":"Doss","year":"2020"},{"key":"2026032712175530700_ref057","volume-title":"Theory of Uniform Approximation of Functions by Polynomials","author":"Dzyadyk","year":"2008"},{"issue":"4","key":"2026032712175530700_ref058","doi-asserted-by":"crossref","first-page":"1609","DOI":"10.1214\/aos\/1176351056","article-title":"\u201cEstimation of the mixing distribution for a normal mean with applications to the compound decision problem\u201d","volume":"16","author":"Edelman","year":"1988","journal-title":"The Annals of Statistics"},{"issue":"3","key":"2026032712175530700_ref059","first-page":"435","article-title":"\u201cEstimating the number of unseen species: How many words did Shakespeare know?\u201d","volume":"63","author":"Efron","year":"1976","journal-title":"Biometrika"},{"issue":"2","key":"2026032712175530700_ref060","doi-asserted-by":"crossref","first-page":"340","DOI":"10.1214\/aos\/1176345778","article-title":"\u201cMaximum likelihood and decision theory\u201d","volume":"10","author":"Efron","year":"1982","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref061","first-page":"185","article-title":"\u201cEstimation of the size of a coinage: A survey and comparison of methods\u201d","volume":"146","author":"Esty","year":"1986","journal-title":"The Numismatic Chronicle (1966\u2013)"},{"issue":"1","key":"2026032712175530700_ref062","doi-asserted-by":"crossref","first-page":"42","DOI":"10.2307\/1411","article-title":"\u201cThe relation between the number of species and the number of individuals in a random sample of an animal population\u201d","volume":"12","author":"Fisher","year":"1943","journal-title":"Journal of Animal Ecology"},{"key":"2026032712175530700_ref063","volume-title":"Orthogonal Polynomials","author":"Freud","year":"1971"},{"key":"2026032712175530700_ref064","volume-title":"Finite Mixture and Markov Switching Models","author":"Fr\u00fchwirth-Schnatter","year":"2006"},{"key":"2026032712175530700_ref065","doi-asserted-by":"crossref","first-page":"645","DOI":"10.1109\/ISIT.2006.261864","volume-title":"2006 IEEE International Symposium on Information Theory","author":"Gao","year":"2006"},{"key":"2026032712175530700_ref066","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198506720.001.0001","volume-title":"Orthogonal Polynomials: Computation and Approximation","author":"Gautschi","year":"2004"},{"issue":"4","key":"2026032712175530700_ref067","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.1214\/aos\/1015956709","article-title":"\u201cRates of convergence for the Gaussian mixture sieve\u201d","volume":"28","author":"Genovese","year":"2000","journal-title":"Annals of Statistics"},{"issue":"5","key":"2026032712175530700_ref068","doi-asserted-by":"crossref","first-page":"1233","DOI":"10.1214\/aos\/1013203452","article-title":"\u201cEntropies and rates of convergence for maximum likelihood and Bayes estimation for mixtures of normal densities\u201d","volume":"29","author":"Ghosal","year":"2001","journal-title":"The Annals of Statistics"},{"issue":"3","key":"2026032712175530700_ref069","doi-asserted-by":"crossref","first-page":"419","DOI":"10.1111\/j.1751-5823.2002.tb00178.x","article-title":"\u201cOn choosing and bounding probability metrics\u201d","volume":"70","author":"Gibbs","year":"2002","journal-title":"International Statistical Review"},{"key":"2026032712175530700_ref070","unstructured":"\u201cGlobal language monitor. Number of words in the English language\u201d\n           (n.d.). URL: https:\/\/www.languagemonitor.com\/global-english\/no-of-words\/.Accessed: 2016-02-16."},{"issue":"106","key":"2026032712175530700_ref071","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1090\/S0025-5718-69-99647-1","article-title":"\u201cCalculation of Gauss quadrature rules\u201d","volume":"23","author":"Golub","year":"1969","journal-title":"Mathematics of Computation"},{"issue":"1\u20132","key":"2026032712175530700_ref072","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1093\/biomet\/43.1-2.45","article-title":"\u201cThe number of new species, and the increase in population coverage, when a sample is increased\u201d","volume":"43","author":"Good","year":"1956","journal-title":"Biometrika"},{"issue":"3\u20134","key":"2026032712175530700_ref073","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1093\/biomet\/40.3-4.237","article-title":"\u201cThe population frequencies of species and the estimation of population parameters\u201d","volume":"40","author":"Good","year":"1953","journal-title":"Biometrika"},{"key":"2026032712175530700_ref074","first-page":"39","article-title":"\u201cEstimating species richness\u201d","volume":"12","author":"Gotelli","year":"2011","journal-title":"Biological Diversity: Frontiers in Measurement and Assessment"},{"key":"2026032712175530700_ref075","volume-title":"Table of Integrals Series and Products","author":"Gradshteyn","year":"2007"},{"key":"2026032712175530700_ref076","volume-title":"Generalized Method of Moments","author":"Hall","year":"2005"},{"key":"2026032712175530700_ref077","volume-title":"Proceedings of the Thirty-second Conference on Neural Information Processing Systems","author":"Han","year":"2018"},{"key":"2026032712175530700_ref078","doi-asserted-by":"crossref","first-page":"1372","DOI":"10.1109\/ISIT.2015.7282680","volume-title":"2015 IEEE International Symposium on Information Theory (ISIT)","author":"Han","year":"2015"},{"key":"2026032712175530700_ref079","doi-asserted-by":"crossref","first-page":"1367","DOI":"10.1109\/ISIT.2015.7282679","volume-title":"2015 IEEE International Symposium on Information Theory (ISIT)","author":"Han","year":"2015"},{"key":"2026032712175530700_ref080","first-page":"3189","volume-title":"Proceedings of the 31st Conference on Learning Theory","author":"Han","year":"2018"},{"key":"2026032712175530700_ref081","volume-title":"To appear in Annals of Statistics","author":"Han","year":"2017"},{"issue":"4","key":"2026032712175530700_ref082","doi-asserted-by":"crossref","first-page":"1029","DOI":"10.2307\/1912775","article-title":"\u201cLarge sample properties of generalized method of moments estimators\u201d","volume":"50","author":"Hansen","year":"1982","journal-title":"Econometrica"},{"key":"2026032712175530700_ref083","volume-title":"Wavelets, Approximation, and Statistical Applications","author":"Hardle","year":"2012"},{"key":"2026032712175530700_ref084","doi-asserted-by":"crossref","first-page":"753","DOI":"10.1145\/2746539.2746579","volume-title":"Proceedings of the Forty-Seventh Annual ACM on Symposium on Theory of Computing","author":"Hardt","year":"2015"},{"key":"2026032712175530700_ref085","first-page":"323","volume-title":"Topics in Information Theory","author":"Harris","year":"1975"},{"issue":"6A","key":"2026032712175530700_ref086","doi-asserted-by":"crossref","first-page":"2844","DOI":"10.1214\/17-AOS1641","article-title":"\u201cStrong identifiability and optimal minimax rates for finite mixture estimation\u201d","volume":"46","author":"Heinrich","year":"2018","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref087","doi-asserted-by":"crossref","first-page":"1021","DOI":"10.1145\/3188745.3188748","volume-title":"Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing","author":"Hopkins","year":"2018"},{"key":"2026032712175530700_ref088","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9781139020411","volume-title":"Matrix Analysis","author":"Horn","year":"2012","edition":"2"},{"key":"2026032712175530700_ref089","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1145\/308386.308455","volume-title":"Proceedings of the Seventh ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems","author":"Hou","year":"1988"},{"key":"2026032712175530700_ref090","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1145\/2422436.2422439","volume-title":"Proceedings of the 4th Conference on Innovations in Theoretical Computer Science","author":"Hsu","year":"2013"},{"issue":"3","key":"2026032712175530700_ref091","doi-asserted-by":"crossref","first-page":"1365","DOI":"10.1093\/genetics\/159.3.1365","article-title":"\u201cEstimating the total number of alleles using a sample coverage method\u201d","volume":"159","author":"Huang","year":"2001","journal-title":"Genetics"},{"key":"2026032712175530700_ref092","first-page":"359","volume-title":"Lecture Notes-Monograph Series","author":"Ibragimov","year":"2001"},{"key":"2026032712175530700_ref093","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4899-0027-2","volume-title":"Statistical Estimation: Asymptotic Theory","author":"Ibragimov","year":"1981"},{"issue":"3","key":"2026032712175530700_ref094","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1137\/1131054","article-title":"\u201cSome problems on nonparametric estimation in Gaussian white noise\u201d","volume":"31","author":"Ibragimov","year":"1987","journal-title":"Theory of Probability & Its Applications"},{"issue":"13","key":"2026032712175530700_ref095","doi-asserted-by":"crossref","first-page":"5008","DOI":"10.1073\/pnas.0807815106","article-title":"\u201cEstimating the number of unseen variants in the human genome\u201d","volume":"106","author":"Ionita-Laza","year":"2009","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"2026032712175530700_ref096","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9781107325982","volume-title":"Classical and Quantum Orthogonal Polynomials in One Variable","author":"Ismail","year":"2005"},{"key":"2026032712175530700_ref097","first-page":"151","article-title":"\u201cOn a new characteristic of functions. II. Direct and converse theorems for the best algebraic approximation in C[\u20141,1] and Lp[\u20141,1]\u201d","volume":"5","author":"Ivanov","year":"1983","journal-title":"Pliska"},{"issue":"2","key":"2026032712175530700_ref098","doi-asserted-by":"crossref","first-page":"479","DOI":"10.1214\/aos\/1176345789","article-title":"\u201cMixtures of exponential distributions\u201d","volume":"10","author":"Jewell","year":"1982","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref099","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1109\/ACSSC.2016.7869051","volume-title":"2016 50th Asilomar Conference on Signals, Systems and Computers","author":"Jiao","year":"2016"},{"issue":"10","key":"2026032712175530700_ref100","doi-asserted-by":"crossref","first-page":"6220","DOI":"10.1109\/TIT.2013.2267934","article-title":"\u201cUniversal estimation of directed information\u201d","volume":"59","author":"Jiao","year":"2013","journal-title":"IEEE Trans. Inf. Theory"},{"issue":"5","key":"2026032712175530700_ref101","doi-asserted-by":"crossref","first-page":"2835","DOI":"10.1109\/TIT.2015.2412945","article-title":"\u201cMinimax estimation of functionals of discrete distributions\u201d","volume":"61","author":"Jiao","year":"2015","journal-title":"IEEE Transactions on Information Theory"},{"issue":"10","key":"2026032712175530700_ref102","doi-asserted-by":"crossref","first-page":"6774","DOI":"10.1109\/TIT.2017.2733537","article-title":"\u201cMaximum likelihood estimation of functionals of discrete distributions\u201d","volume":"63","author":"Jiao","year":"2017","journal-title":"IEEE Transactions on Information Theory"},{"issue":"5A","key":"2026032712175530700_ref103","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1214\/08-AOS654","article-title":"\u201cNonparametric estimation by convex programming\u201d","volume":"37","author":"Juditsky","year":"2009","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref104","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1145\/1806689.1806765","volume-title":"Proceedings of the FortySecond ACM Symposium on Theory of Computing","author":"Kalai","year":"2010"},{"key":"2026032712175530700_ref105","first-page":"444","volume-title":"International Conference on Computational Learning Theory","author":"Kannan","year":"2005"},{"key":"2026032712175530700_ref106","doi-asserted-by":"crossref","DOI":"10.1090\/memo\/0012","volume-title":"Geometry of Moment Spaces","author":"Karlin","year":"1953"},{"issue":"3","key":"2026032712175530700_ref107","doi-asserted-by":"crossref","first-page":"577","DOI":"10.1016\/S0167-9473(02)00177-9","article-title":"\u201cChoosing initial values for the EM algorithm for finite mixtures\u201d","volume":"41","author":"Karlis","year":"2003","journal-title":"Computational Statistics & Data Analysis"},{"issue":"1","key":"2026032712175530700_ref108","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1111\/j.1751-5823.2005.tb00250.x","volume":"73","author":"Karlis","year":"2005","journal-title":"International Statistical Review"},{"issue":"2","key":"2026032712175530700_ref109","doi-asserted-by":"crossref","first-page":"782","DOI":"10.1109\/TIT.2012.2222343","article-title":"\u201cClassification of homogeneous data with large alphabets\u201d","volume":"59","author":"Kelly","year":"2013","journal-title":"IEEE Transactions on Information Theory"},{"key":"2026032712175530700_ref110","first-page":"887","volume-title":"The Annals of Mathematical Statistics:","author":"Kiefer","year":"1956"},{"issue":"4","key":"2026032712175530700_ref111","doi-asserted-by":"crossref","first-page":"1802","DOI":"10.3150\/13-BEJ542","article-title":"\u201cMinimax bounds for estimation of normal mixtures\u201d","volume":"20","author":"Kim","year":"2014","journal-title":"Bernoulli"},{"key":"2026032712175530700_ref112","first-page":"2076","volume-title":"Proceedings of the Twenty-seventh Conference on Neural Information Processing Systems","author":"Knudson","year":"2013"},{"issue":"506","key":"2026032712175530700_ref113","doi-asserted-by":"crossref","first-page":"674","DOI":"10.1080\/01621459.2013.869224","article-title":"\u201cConvex optimization, shape constraints, compound decisions, and empirical Bayes rules\u201d","volume":"109","author":"Koenker","year":"2014","journal-title":"Journal of the American Statistical Association"},{"issue":"5","key":"2026032712175530700_ref114","doi-asserted-by":"crossref","first-page":"2218","DOI":"10.1214\/16-AOS1525","article-title":"\u201cSpectrum estimation from samples\u201d","volume":"45","author":"Kong","year":"2017","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref115","volume-title":"Introduction to Empirical Processes and Semiparametric Inference","author":"Kosorok","year":"2007"},{"key":"2026032712175530700_ref116","unstructured":"Krawtchouk, M.\n           (1932). \u201cSur le probl\u00e8me de moments\u201d. In: ICM Proceedings. Available athttps:\/\/www.mathunion.org\/fileadmin\/ICM\/Proceedings\/ICM1932.2\/ICM1932.2.ocr.pdf.127\u2013128."},{"key":"2026032712175530700_ref117","doi-asserted-by":"crossref","DOI":"10.1090\/mmono\/050","volume-title":"The Markov Moment Problem and Extremal Problems","author":"Krein","year":"1977"},{"key":"2026032712175530700_ref118","first-page":"1779","volume-title":"2004 International Conference on Image Processing, 2004. ICIP\u201904","author":"Kybic","year":"2004"},{"issue":"364","key":"2026032712175530700_ref119","doi-asserted-by":"crossref","first-page":"805","DOI":"10.1080\/01621459.1978.10480103","article-title":"\u201cNonparametric maximum likelihood estimation of a mixing distribution\u201d","volume":"73","author":"Laird","year":"1978","journal-title":"Journal of the American Statistical Association"},{"key":"2026032712175530700_ref120","doi-asserted-by":"crossref","DOI":"10.1142\/p665","volume-title":"Moments, Positive Polynomials and Their Applications","author":"Lasserre","year":"2009"},{"issue":"2","key":"2026032712175530700_ref121","doi-asserted-by":"crossref","first-page":"659","DOI":"10.1214\/aos\/1032894458","article-title":"\u201cEfficient estimation of integral functionals of a density\u201d","volume":"24","author":"Laurent","year":"1996","journal-title":"The Annals of Statistics"},{"issue":"1","key":"2026032712175530700_ref122","first-page":"38","article-title":"\u201cConvergence of estimates under dimensionality restrictions\u201d","volume":"1","author":"Le Cam","year":"1973","journal-title":"The Annals of Statistics"},{"issue":"2","key":"2026032712175530700_ref123","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1007\/s004409970006","article-title":"\u201cOn estimation of the Lr norm of a regression function\u201d","volume":"113","author":"Lepski","year":"1999","journal-title":"Probability Theory and Related Fields"},{"key":"2026032712175530700_ref124","first-page":"1302","volume-title":"Proceedings of the 30th Annual Conference on Learning Theory (COLT 2017)","author":"Li","year":"2017"},{"key":"2026032712175530700_ref125","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1007\/978-94-009-8552-0_8","volume-title":"Statistical Distributions in Scientific Work","author":"Lindsay","year":"1981"},{"issue":"2","key":"2026032712175530700_ref126","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1214\/aos\/1176347138","article-title":"\u201cMoment matrices: Applications in mixtures\u201d","volume":"17","author":"Lindsay","year":"1989","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref127","volume-title":"NSF-CBMS Regional Conference Series in Probability and Statistics","author":"Lindsay","year":"1995"},{"issue":"2","key":"2026032712175530700_ref128","doi-asserted-by":"crossref","first-page":"1094","DOI":"10.1214\/aos\/1176348672","article-title":"\u201cFrom the species problem to a general coverage problem via a new interpretation\u201d","volume":"20","author":"Lo","year":"1992","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref129","author":"Lu","year":"2016"},{"issue":"5216","key":"2026032712175530700_ref130","doi-asserted-by":"crossref","first-page":"1503","DOI":"10.1126\/science.7770778","article-title":"\u201cReliability of spike timing in neocortical neurons\u201d","volume":"268","author":"Mainen","year":"1995","journal-title":"Science"},{"key":"2026032712175530700_ref131","volume-title":"Foundations of Statistical Natural Language Processing","author":"Manning","year":"1999"},{"issue":"341","key":"2026032712175530700_ref132","doi-asserted-by":"crossref","first-page":"92","DOI":"10.1080\/01621459.1973.10481342","article-title":"\u201cEstimating an author\u2019s vocabulary\u201d","volume":"68","author":"McNeil","year":"1973","journal-title":"Journal of the American Statistical Association"},{"issue":"1","key":"2026032712175530700_ref133","first-page":"1235","article-title":"\u201cMllib: Machine learning in apache spark\u201d","volume":"17","author":"Meng","year":"2016","journal-title":"The Journal of Machine Learning Research"},{"issue":"3","key":"2026032712175530700_ref134","doi-asserted-by":"crossref","first-page":"511","DOI":"10.1111\/1467-9868.00082","article-title":"\u201cThe EM algorithm\u2014An old folk-song sung to a fast new tune\u201d","volume":"59","author":"Meng","year":"1997","journal-title":"Journal of the Royal Statistical Society: Series B (Statistical Methodology)"},{"key":"2026032712175530700_ref135","first-page":"95","article-title":"\u201cNote on the bias of information estimates\u201d","volume":"2","author":"Miller","year":"1955","journal-title":"Information Theory in Psychology: Problems and Methods"},{"key":"2026032712175530700_ref136","volume-title":"Machine Learning","author":"Mitchell","year":"1997"},{"key":"2026032712175530700_ref137","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511813603","volume-title":"Probability and Computing: Randomized Algorithms and Probabilistic Analysis","author":"Mitzenmacher","year":"2005"},{"key":"2026032712175530700_ref138","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1109\/FOCS.2010.15","volume-title":"2010 51st Annual IEEE Symposium on Foundations of Computer Science (FOCS)","author":"Moitra","year":"2010"},{"issue":"1","key":"2026032712175530700_ref139","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1214\/aos\/1176345690","article-title":"\u201cNatural exponential families with quadratic variance functions\u201d","volume":"10","author":"Morris","year":"1982","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref140","first-page":"499","volume-title":"International Conference on Database Theory","author":"Naughton","year":"1990"},{"issue":"5","key":"2026032712175530700_ref141","doi-asserted-by":"crossref","first-page":"056","DOI":"10.1103\/PhysRevE.69.056111","article-title":"\u201cEntropy and information in neural spike trains: Progress on the sampling problem\u201d","volume":"69","author":"Nemenman","year":"2004","journal-title":"Physical Review E"},{"key":"2026032712175530700_ref142","first-page":"2419","volume-title":"Proceedings of the 42nd IEEE Conference on Decision and Control","author":"Nemirovski","year":"2003"},{"issue":"3","key":"2026032712175530700_ref143","first-page":"207","article-title":"\u201cApproximation of functions in the mean by trigonometrical polynomials\u201d","volume":"10","author":"Nikolsky","year":"1946","journal-title":"Izvestiya Rossiiskoi Akademii Nauk. Seriya Matematicheskaya"},{"key":"2026032712175530700_ref144","volume-title":"Discrete-Time Signal Processing","author":"Oppenheim","year":"1999"},{"issue":"7","key":"2026032712175530700_ref145","doi-asserted-by":"crossref","first-page":"1469","DOI":"10.1109\/TIT.2004.830761","article-title":"\u201cUniversal compression of memoryless sources over unknown alphabets\u201d","volume":"50","author":"Orlitsky","year":"2004","journal-title":"IEEE Transactions on Information Theory"},{"key":"2026032712175530700_ref146","first-page":"2143","volume-title":"Proceedings of the Twenty-Ninth Conference on Neural Information Processing Systems","author":"Orlitsky","year":"2015"},{"issue":"47","key":"2026032712175530700_ref147","doi-asserted-by":"crossref","first-page":"13283","DOI":"10.1073\/pnas.1607774113","article-title":"\u201cOptimal prediction of the number of unseen species\u201d","volume":"113","author":"Orlitsky","year":"2016","journal-title":"Proceedings of the National Academy of Sciences (PNAS)"},{"key":"2026032712175530700_ref148","unstructured":"\u201cOxford English Dictionary\u201d\n           (n.d.). http:\/\/public.oed.com\/about\/. Accessed: 2016-02-16."},{"issue":"6","key":"2026032712175530700_ref149","doi-asserted-by":"crossref","first-page":"1191","DOI":"10.1162\/089976603321780272","article-title":"\u201cEstimation of entropy and mutual information\u201d","volume":"15","author":"Paninski","year":"2003","journal-title":"Neural Computation"},{"issue":"9","key":"2026032712175530700_ref150","doi-asserted-by":"crossref","first-page":"2200","DOI":"10.1109\/TIT.2004.833360","article-title":"\u201cEstimating entropy on m bins given fewer than m samples\u201d","volume":"50","author":"Paninski","year":"2004","journal-title":"IEEE Transactions on Information Theory"},{"key":"2026032712175530700_ref151","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1098\/rsta.1894.0003","article-title":"\u201cContributions to the mathematical theory of evolution\u201d","volume":"185","author":"Pearson","year":"1894","journal-title":"Philosophical Transactions of the Royal Society of London. A"},{"key":"2026032712175530700_ref152","first-page":"2825","article-title":"\u201cScikit-learn: Machine learning in python\u201d","volume":"12","author":"Pedregosa","year":"2011","journal-title":"Journal of Machine Learning Research"},{"key":"2026032712175530700_ref153","volume-title":"Rational Approximation of Real Functions","author":"Petrushev","year":"2011"},{"issue":"2","key":"2026032712175530700_ref154","doi-asserted-by":"crossref","first-page":"535","DOI":"10.1093\/biomet\/88.2.535","article-title":"#x201C;Alternative EM methods for nonparametric finite mixture models\u201d","volume":"88","author":"Pilla","year":"2001","journal-title":"Biometrika"},{"issue":"2","key":"2026032712175530700_ref155","first-page":"52","article-title":"\u201cOptimal filtering of square-integrable signals in Gaussian noise\u201d","volume":"16","author":"Pinsker","year":"1980","journal-title":"Problemy Peredachi Informatsii"},{"key":"2026032712175530700_ref156","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1007\/978-94-015-8729-7_29","volume-title":"Maximum Entropy and Bayesian Methods","author":"Plotkin","year":"1996"},{"issue":"8","key":"2026032712175530700_ref157","doi-asserted-by":"crossref","first-page":"986","DOI":"10.1109\/TMI.2003.815867","article-title":"\u201cMutualinformation-based registration of medical images: A survey\u201d","volume":"22","author":"Pluim","year":"2003","journal-title":"IEEE Transactions on Medical Imaging"},{"key":"2026032712175530700_ref158","volume-title":"Proceedings of Conference on Learning Theory (COLT)","author":"Polyanskiy","year":"2017"},{"key":"2026032712175530700_ref159","author":"Polyanskiy","year":"2019"},{"issue":"11","key":"2026032712175530700_ref160","doi-asserted-by":"crossref","first-page":"1282","DOI":"10.1109\/10.959324","article-title":"\u201cEntropy, entropy rate, and pattern classification as tools to typify complexity in short heart period variability series\u201d","volume":"48","author":"Porta","year":"2001","journal-title":"IEEE Transactions on Biomedical Engineering"},{"issue":"11","key":"2026032712175530700_ref161","doi-asserted-by":"crossref","first-page":"1338","DOI":"10.1109\/TIP.2003.818640","article-title":"\u201cImage denoising using scale mixtures of Gaussians in the wavelet domain\u201d","volume":"12","author":"Portilla","year":"2003","journal-title":"IEEE Transactions on Image Processing"},{"key":"2026032712175530700_ref162","volume-title":"Polynomials","author":"Prasolov","year":"2009"},{"issue":"12","key":"2026032712175530700_ref163","doi-asserted-by":"crossref","first-page":"3173","DOI":"10.1109\/TSP.2013.2259161","article-title":"\u201cEfficient methods to compute optimal tree approximations of directed information graphs\u201d","volume":"61","author":"Quinn","year":"2013","journal-title":"IEEE Trans. Signal Process"},{"key":"2026032712175530700_ref164","volume-title":"Nonparametric Functional Estimation","author":"Rao","year":"2014"},{"issue":"3","key":"2026032712175530700_ref165","doi-asserted-by":"crossref","first-page":"813","DOI":"10.1137\/070701649","article-title":"\u201cStrong lower bounds for approximating distribution support size and the distinct elements problem\u201d","volume":"39","author":"Raskhodnikova","year":"2009","journal-title":"SIAM Journal on Computing"},{"issue":"2","key":"2026032712175530700_ref166","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1137\/1026034","article-title":"\u201cMixture densities, maximum likelihood and the EM algorithm\u201d","volume":"26","author":"Redner","year":"1984","journal-title":"SIAM Review"},{"key":"2026032712175530700_ref167","volume":"144","author":"Reimer","year":"2012","journal-title":"Multivariate Polynomial Approximation"},{"key":"2026032712175530700_ref168","volume-title":"Spikes: Exploring the Neural Code","author":"Rieke","year":"1999"},{"key":"2026032712175530700_ref169","volume-title":"An Introduction to the Approximation of Functions","author":"Rivlin","year":"1981"},{"key":"2026032712175530700_ref170","doi-asserted-by":"crossref","DOI":"10.1137\/1.9781611970524","volume-title":"Conjugate Duality and Optimization","author":"Rockafellar","year":"1974"},{"key":"2026032712175530700_ref171","volume-title":"Real and Complex Analysis","author":"Rudin","year":"2006"},{"issue":"5307","key":"2026032712175530700_ref172","doi-asserted-by":"crossref","first-page":"1805","DOI":"10.1126\/science.275.5307.1805","article-title":"\u201cReproducibility and variability in neural spike trains\u201d","volume":"275","author":"Ruyter","year":"1997","journal-title":"Science:"},{"issue":"2","key":"2026032712175530700_ref173","doi-asserted-by":"crossref","first-page":"738","DOI":"10.1214\/19-AOS1817","article-title":"\u201cOn the nonparametric maximum likelihood estimator for gaussian location mixture densities with application to gaussian denoising\u201d","volume":"48","author":"Saha","year":"2020","journal-title":"Annals of Statistics"},{"key":"2026032712175530700_ref174","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-319-64546-9","volume-title":"The Moment Problem","author":"Schm\u00fcdgen","year":"2017"},{"issue":"3","key":"2026032712175530700_ref175","doi-asserted-by":"crossref","first-page":"481","DOI":"10.1023\/A:1004117419204","article-title":"\u201cA cautionary note on likelihood ratio tests in mixture models\u201d","volume":"52","author":"Seidel","year":"2000","journal-title":"Annals of the Institute of Statistical Mathematics"},{"issue":"379-423","key":"2026032712175530700_ref176","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1002\/j.1538-7305.1948.tb00917.x","article-title":"\u201cA mathematical theory of communication\u201d","volume":"27","author":"Shannon","year":"1948","journal-title":"Bell System Technical Journal"},{"key":"2026032712175530700_ref177","doi-asserted-by":"crossref","DOI":"10.1090\/surv\/001","volume-title":"The Problem of Moments","author":"Shohat","year":"1943"},{"issue":"2","key":"2026032712175530700_ref178","doi-asserted-by":"crossref","first-page":"753","DOI":"10.1214\/aos\/1176349952","article-title":"\u201cAn Efron-Stein inequality for nonsymmetric statistics\u201d","volume":"14","author":"Steele","year":"1986","journal-title":"The Annals of Statistics"},{"issue":"1","key":"2026032712175530700_ref179","doi-asserted-by":"crossref","first-page":"1373","DOI":"10.1007\/BF01085007","article-title":"\u201cLectures on the theory of estimation of many parameters\u201d","volume":"34","author":"Stein","year":"1986","journal-title":"Journal of Soviet Mathematics"},{"key":"2026032712175530700_ref180","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-21738-3","volume-title":"Introduction to Numerical Analysis","author":"Stoer","year":"2002","edition":"3"},{"issue":"6","key":"2026032712175530700_ref181","doi-asserted-by":"crossref","first-page":"1348","DOI":"10.1214\/aos\/1176345206","article-title":"\u201cOptimal rates of convergence for nonparametric estimators\u201d","volume":"8","author":"Stone","year":"1980","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref182","doi-asserted-by":"crossref","DOI":"10.1515\/9783110850826","volume-title":"Mathematical Theory of Statistics: Statistical Experiments and Asymptotic Decision Theory","author":"Strasser","year":"1985"},{"issue":"1","key":"2026032712175530700_ref183","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1103\/PhysRevLett.80.197","article-title":"\u201cEntropy and information in neural spike trains\u201d","volume":"80","author":"Strong","year":"1998","journal-title":"Phys. Rev. Lett"},{"key":"2026032712175530700_ref184","volume-title":"Orthogonal Polynomials","author":"Szeg\u00f6","year":"1975","edition":"4"},{"issue":"3","key":"2026032712175530700_ref185","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1093\/biomet\/74.3.445","article-title":"\u201cDid Shakespeare write a newly-discovered poem?\u201d","volume":"74","author":"Thisted","year":"1987","journal-title":"Biometrika"},{"key":"2026032712175530700_ref186","volume-title":"Theory of Approximation of Functions of a Real Variable","author":"Timan","year":"1963"},{"key":"2026032712175530700_ref187","first-page":"473","volume-title":"International Conference on Medical Image Computing and Computer-Assisted Intervention","author":"Tsai","year":"1999"},{"issue":"4","key":"2026032712175530700_ref188","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1016\/j.media.2004.01.003","article-title":"\u201cMutual information in coupled multi-shape model for medical image segmentation\u201d","volume":"8","author":"Tsai","year":"2004","journal-title":"Medical Image Analysis"},{"key":"2026032712175530700_ref189","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511807213","volume-title":"Fundamentals of Wireless Communication","author":"Tse","year":"2005"},{"key":"2026032712175530700_ref190","doi-asserted-by":"crossref","DOI":"10.1007\/b13794","volume-title":"Introduction to Nonparametric Estimation","author":"Tsybakov","year":"2009"},{"key":"2026032712175530700_ref191","volume-title":"Introduction to Mathematical Probability","author":"Uspensky","year":"1937"},{"key":"2026032712175530700_ref192","author":"Valiant","year":"2017"},{"issue":"179","key":"2026032712175530700_ref193","first-page":"1","article-title":"\u201cA CLT and tight lower bounds for estimating entropy\u201d","volume":"17","author":"Valiant","year":"2010","journal-title":"Electronic Colloquium on Computational Complexity (ECCC)"},{"key":"2026032712175530700_ref194","first-page":"685","volume-title":"Proceedings of the 43rd Annual ACM Symposium on Theory of Computing","author":"Valiant","year":"2011"},{"key":"2026032712175530700_ref195","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1109\/FOCS.2011.81","volume-title":"2011 IEEE 52nd Annual Symposium on Foundations of Computer Science (FOCS)","author":"Valiant","year":"2011"},{"key":"2026032712175530700_ref196","first-page":"383","volume-title":"Proceedings of the Fortieth Annual ACM Symposium on Theory of Computing. STOC \u201808","author":"Valiant","year":"2008"},{"issue":"6","key":"2026032712175530700_ref197","doi-asserted-by":"crossref","first-page":"1927","DOI":"10.1137\/080734066","article-title":"\u201cTesting symmetric properties of distributions\u201d","volume":"40","author":"Valiant","year":"2011","journal-title":"SIAM Journal on Computing"},{"key":"2026032712175530700_ref198","first-page":"2157","volume-title":"Proceedings of the Twenty-Seventh Conference on Neural Information Processing Systems","author":"Valiant","year":"2013"},{"key":"2026032712175530700_ref199","volume-title":"Asymptotic Statistics","author":"Vander Vaart","year":"2000"},{"issue":"4","key":"2026032712175530700_ref200","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1016\/j.jcss.2003.11.008","article-title":"\u201cA spectral algorithm for learning mixture models\u201d","volume":"68","author":"Vempala","year":"2004","journal-title":"Journal of Computer and System Sciences"},{"key":"2026032712175530700_ref201","doi-asserted-by":"crossref","DOI":"10.1090\/gsm\/058","volume-title":"Topics in Optimal Transportation","author":"Villani","year":"2003"},{"key":"2026032712175530700_ref202","volume-title":"Optimal Transport: Old and New","author":"Villani","year":"2008"},{"issue":"5","key":"2026032712175530700_ref203","doi-asserted-by":"crossref","first-page":"051139","DOI":"10.1103\/PhysRevE.85.051139","article-title":"\u201cEstimation of the entropy based on its polynomial representation\u201d","volume":"85","author":"Vinck","year":"2012","journal-title":"Physical Review E"},{"issue":"6","key":"2026032712175530700_ref204","doi-asserted-by":"crossref","first-page":"3207","DOI":"10.1109\/TIT.2011.2137210","article-title":"\u201cProbability estimation in the rare-events regime\u201d","volume":"57","author":"Wagner","year":"2011","journal-title":"IEEE Trans. Inf. Theory"},{"key":"2026032712175530700_ref205","first-page":"855","volume-title":"Proceedings of the Thirteenth Conference on Neural Information Processing Systems (NIPS 1999)","author":"Wainwright","year":"2000"},{"key":"2026032712175530700_ref206","volume-title":"Handbook of Semidefinite Programming: Theory, Algorithms, and Applications","author":"Wolkowicz","year":"2012"},{"key":"2026032712175530700_ref207","doi-asserted-by":"crossref","first-page":"620","DOI":"10.1109\/ALLERTON.2010.5706965","volume-title":"2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","author":"Wu","year":"2010"},{"issue":"6","key":"2026032712175530700_ref208","doi-asserted-by":"crossref","first-page":"3702","DOI":"10.1109\/TIT.2016.2548468","article-title":"\u201cMinimax rates of entropy estimation on large alphabets via best polynomial approximation\u201d","volume":"62","author":"Wu","year":"2016","journal-title":"IEEE Transactions on Information Theory"},{"issue":"1","key":"2026032712175530700_ref209","doi-asserted-by":"crossref","first-page":"37","DOI":"10.4171\/msl\/1-1-2","article-title":"\u201cSample complexity of the distinct elements problem\u201d","volume":"1","author":"Wu","year":"2018","journal-title":"Mathematical Statistics and Learning"},{"issue":"2","key":"2026032712175530700_ref210","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1214\/17-AOS1665","article-title":"\u201cChebyshev polynomials, moment matching, and optimal estimation of the unseen\u201d","volume":"47","author":"Wu","year":"2019","journal-title":"The Annals of Statistics"},{"issue":"4","key":"2026032712175530700_ref211","doi-asserted-by":"crossref","first-page":"1981","DOI":"10.1214\/19-AOS1873","article-title":"\u201cOptimal estimation of Gaussian mixtures via denoised method of moments\u201d","volume":"48","author":"Wu","year":"2020","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref212","doi-asserted-by":"publisher","author":"Wu","year":"2020","DOI":"10.1214\/19-AOS1873SUPP"},{"key":"2026032712175530700_ref213","first-page":"2676","volume-title":"Proceedings of the Thirtieth Conference on Neural Information Processing Systems","author":"Xu","year":"2016"},{"issue":"1","key":"2026032712175530700_ref214","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1162\/neco.1996.8.1.129","article-title":"\u201cOn convergence properties of the EM algorithm for Gaussian mixtures\u201d","volume":"8","author":"Xu","year":"1996","journal-title":"Neural Computation"},{"key":"2026032712175530700_ref215","volume-title":"MA thesis","author":"Yang","year":"2016"},{"issue":"5","key":"2026032712175530700_ref216","doi-asserted-by":"crossref","first-page":"1564","DOI":"10.1214\/aos\/1017939142","article-title":"\u201cInformation-theoretic determination of minimax rates of convergence\u201d","volume":"27","author":"Yang","year":"1999","journal-title":"The Annals of Statistics"},{"key":"2026032712175530700_ref217","doi-asserted-by":"crossref","first-page":"423","DOI":"10.1007\/978-1-4612-1880-7_29","volume-title":"Festschrift for Lucien Le Cam:","author":"Yu","year":"1997"},{"key":"2026032712175530700_ref218","first-page":"1297","article-title":"\u201cGeneralized maximum likelihood estimation of normal mixture densities\u201d","volume":"19","author":"Zhang","year":"2009","journal-title":"Statistica Sinica"},{"key":"2026032712175530700_ref219","doi-asserted-by":"crossref","first-page":"13293","DOI":"10.1038\/ncomms13293","article-title":"\u201cQuantifying unobserved protein-coding variants in human populations provides a roadmap for large-scale sequencing projects\u201d","volume":"7","author":"Zou","year":"2016","journal-title":"Nature Communications"}],"container-title":["Foundations and Trends\u00ae in Communications and Information Theory"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.emerald.com\/ftcit\/article-pdf\/17\/4\/402\/11160644\/0100000095en.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/www.emerald.com\/ftcit\/article-pdf\/17\/4\/402\/11160644\/0100000095en.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T14:10:36Z","timestamp":1777471836000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.emerald.com\/ftcit\/article\/17\/4\/402\/1332826\/Polynomial-Methods-in-Statistical-Inference-Theory"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,12]]},"references-count":219,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2020,10,12]]}},"URL":"https:\/\/doi.org\/10.1561\/0100000095","relation":{},"ISSN":["1567-2190","1567-2328"],"issn-type":[{"value":"1567-2190","type":"print"},{"value":"1567-2328","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,10,12]]}}}