{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T22:12:17Z","timestamp":1777759937321,"version":"3.51.4"},"reference-count":20,"publisher":"MIT Press - Journals","issue":"1","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Neural Computation"],"published-print":{"date-parts":[[2004,1,1]]},"abstract":"<jats:p> From a smooth, strictly convex function \u03a6: R<jats:sup>n<\/jats:sup> \u2192 R, a parametric family of divergence function D<jats:sub>\u03a6<\/jats:sub><jats:sup>(\u03b1)<\/jats:sup> may be introduced: <\/jats:p><jats:p> [Formula: see text] <\/jats:p><jats:p> for x, y, \u03b5 int dom(\u03a6) and for \u03b1 \u03b5 R, with D<jats:sub>\u03a6<\/jats:sub><jats:sup>(\u00b11<\/jats:sup> defined through taking the limit of \u03b1. Each member is shown to induce an \u03b1-independent Riemannian metric, as well as a pair of dual \u03b1-connections, which are generally nonflat, except for \u03b1 = \u00b11. In the latter case, D(\u00b11)<jats:sub>\u03a6<\/jats:sub> reduces to the (nonparametric) Bregman divergence, which is representable using and its convex conjugate \u03a6 * and becomes the canonical divergence for dually flat spaces (Amari, 1982, 1985; Amari &amp; Nagaoka, 2000). This formulation based on convex analysis naturally extends the information-geometric interpretation of divergence functions (Eguchi, 1983) to allow the distinction between two different kinds of duality: referential duality (\u03b1 \uf578-\u03b1) and representational duality (\u03a6 \uf578 \u03a6 *). When applied to (not necessarily normalized) probability densities, the concept of conjugated representations of densities is introduced, so that \u00b1 \u03b1-connections defined on probability densities embody both referential and representational duality and are hence themselves bidual. When restricted to a finite-dimensional affine submanifold, the natural parameters of a certain representation of densities and the expectation parameters under its conjugate representation form biorthogonal coordinates. The alpha representation (indexed by \u03b2 now, \u03b2 \u03b5 [\u22121, 1]) is shown to be the only measure-invariant representation. The resulting two-parameter family of divergence functionals D<jats:sup>(\u03b1, \u03b2)<\/jats:sup>, (\u03b1, \u03b2) \u03b5 [\u22121, 1] \u00d7 [-1, 1] induces identical Fisher information but bidual alpha-connection pairs; it reduces in form to Amari's alpha-divergence family when \u03b1 =\u00b11 or when \u03b2 = 1, but to the family of Jensen difference (Rao, 1987) when \u03b2 = 1. <\/jats:p>","DOI":"10.1162\/08997660460734047","type":"journal-article","created":{"date-parts":[[2004,1,21]],"date-time":"2004-01-21T22:31:06Z","timestamp":1074724266000},"page":"159-195","source":"Crossref","is-referenced-by-count":121,"title":["Divergence Function, Duality, and Convex Analysis"],"prefix":"10.1162","volume":"16","author":[{"given":"Jun","family":"Zhang","sequence":"first","affiliation":[{"name":"Department of Psychology, University of Michigan, Ann Arbor, MI 48109, U. S. A,"}]}],"member":"281","reference":[{"key":"p_1","doi-asserted-by":"publisher","DOI":"10.1207\/s15516709cog0901_7"},{"key":"p_2","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1176345779"},{"key":"p_4","doi-asserted-by":"publisher","DOI":"10.1016\/0893-6080(91)90040-C"},{"key":"p_5","doi-asserted-by":"publisher","DOI":"10.1016\/0893-6080(95)00003-8"},{"key":"p_6","doi-asserted-by":"publisher","DOI":"10.1109\/72.125867"},{"key":"p_9","first-page":"184","volume":"02","author":"Bauschke H. H.","year":"2002","journal-title":"CECM Preprint"},{"key":"p_10","first-page":"186","volume":"02","author":"Bauschke H. H.","year":"2002","journal-title":"CECM Preprint"},{"key":"p_11","doi-asserted-by":"publisher","DOI":"10.1016\/0041-5553(67)90040-7"},{"key":"p_13","first-page":"329","volume":"2","author":"Csisz\u00e1r I.","year":"1967","journal-title":"Studia Mathematicarum Hungarica"},{"key":"p_15","doi-asserted-by":"publisher","DOI":"10.1214\/aos\/1176346246"},{"key":"p_16","doi-asserted-by":"publisher","DOI":"10.32917\/hmj\/1206128508"},{"key":"p_21","doi-asserted-by":"publisher","DOI":"10.2748\/tmj\/1178225722"},{"key":"p_25","doi-asserted-by":"publisher","DOI":"10.14492\/hokmj\/1351001292"},{"key":"p_26","doi-asserted-by":"publisher","DOI":"10.32917\/hmj\/1206125160"},{"key":"p_27","doi-asserted-by":"publisher","DOI":"10.32917\/hmj\/1206128255"},{"key":"p_28","doi-asserted-by":"publisher","DOI":"10.1162\/089976602760128045"},{"key":"p_31","first-page":"509","volume":"15","author":"Shima H.","year":"1978","journal-title":"Osaka Journal of Mathematics"},{"key":"p_32","doi-asserted-by":"publisher","DOI":"10.1016\/S0926-2245(96)00057-5"},{"key":"p_34","first-page":"501","volume":"37","author":"Uohashi K.","year":"2000","journal-title":"Osaka Journal of Mathematics"},{"key":"p_35","doi-asserted-by":"publisher","DOI":"10.1007\/BF02309013"}],"container-title":["Neural Computation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mitpressjournals.org\/doi\/pdf\/10.1162\/08997660460734047","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,12]],"date-time":"2021-03-12T21:51:00Z","timestamp":1615585860000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/neco\/article\/16\/1\/159-195\/6802"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2004,1,1]]},"references-count":20,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2004,1,1]]}},"alternative-id":["10.1162\/08997660460734047"],"URL":"https:\/\/doi.org\/10.1162\/08997660460734047","relation":{},"ISSN":["0899-7667","1530-888X"],"issn-type":[{"value":"0899-7667","type":"print"},{"value":"1530-888X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2004,1,1]]}}}