{"title": "Blossom Tree Graphical Models", "book": "Advances in Neural Information Processing Systems", "page_first": 1458, "page_last": 1465, "abstract": "We combine the ideas behind trees and Gaussian graphical models to form a new nonparametric family of graphical models. Our approach is to attach nonparanormal blossoms\", with arbitrary graphs, to a collection of nonparametric trees. The tree edges are chosen to connect variables that most violate joint Gaussianity. The non-tree edges are partitioned into disjoint groups, and assigned to tree nodes using a nonparametric partial correlation statistic. A nonparanormal blossom is then \"grown\" for each group using established methods based on the graphical lasso. The result is a factorization with respect to the union of the tree branches and blossoms, defining a high-dimensional joint density that can be efficiently estimated and evaluated on test points. Theoretical properties and experiments with simulated and real data demonstrate the effectiveness of blossom trees.\"", "full_text": "BlossomTreeGraphicalModelsZheLiuDepartmentofStatisticsUniversityofChicagoJohnLaffertyDepartmentofStatisticsDepartmentofComputerScienceUniversityofChicagoAbstractWecombinetheideasbehindtreesandGaussiangraphicalmodelstoformanewnonparametricfamilyofgraphicalmodels.Ourapproachistoattachnonpara-normal\u201cblossoms\u201d,witharbitrarygraphs,toacollectionofnonparametrictrees.ThetreeedgesarechosentoconnectvariablesthatmostviolatejointGaussianity.Thenon-treeedgesarepartitionedintodisjointgroups,andassignedtotreenodesusinganonparametricpartialcorrelationstatistic.Anonparanormalblossomisthen\u201cgrown\u201dforeachgroupusingestablishedmethodsbasedonthegraphicallasso.Theresultisafactorizationwithrespecttotheunionofthetreebranchesandblossoms,de\ufb01ningahigh-dimensionaljointdensitythatcanbeef\ufb01cientlyes-timatedandevaluatedontestpoints.Theoreticalpropertiesandexperimentswithsimulatedandrealdatademonstratetheeffectivenessofblossomtrees.1IntroductionLetp\u2217(x)beaprobabilitydensityonRdcorrespondingtoarandomvectorX=(X1,...,Xd).TheundirectedgraphG=(V,E)associatedwithp\u2217hasd=|V|verticescorrespondingtoX1,...,Xd,andmissingedges(i,j)6\u2208EwheneverXiandXjareconditionallyindependentgiventheothervariables.Theundirectedgraphisausefulwayofexploringandmodelingthedistribution.Inthispaperweareconcernedwithbuildinggraphicalmodelsforcontinuousvariables,underweakerassumptionsthanthoseimposedbyexistingmethods.Ifp\u2217(x)>0isstrictlypositive,theHammersley-Cliffordtheoremimpliesthatthedensityhastheformp\u2217(x)\u221dYC\u2208C\u03c8C(xC)=exp XC\u2208CfC(xC)!.(1.1)Inthisexpression,Cdenotesthesetofcliquesinthegraph,and\u03c8C(xC)=exp(fC(xC))>0denotesarbitrarypotentialfunctions.Thisrepresentsaverylargeandrichsetofnonparametricgraphicalmodels.Thefundamentaldif\ufb01cultyisthatitisingeneralintractabletocomputethenormalizingconstant.Acompromisemustbemadetoachievecomputationallytractableinference,typicallyinvolvingstrongassumptionsonthefunctionsfC,onthegraphG={C},orboth.ThedefaultmodelforgraphicalmodelingofcontinuousdataisthemultivariateGaussian.WhentheGaussianhascovariancematrix\u03a3,thegraphisencodedinthesparsitypatternoftheprecisionmatrix\u2126=\u03a3\u22121.Speci\ufb01cally,edge(i,j)ismissingifandonlyif\u2126ij=0.Recentworkhasfocusedonsparseestimatesoftheprecisionmatrix[8,10].Inparticular,anef\ufb01cientalgorithmforcomputingtheestimatorusingagraphicalversionofthelassoisdevelopedin[3].Thenonpara-normal[5],aformofGaussiancopula,weakenstheGaussianassumptionbyimposingGaussianityonthetransformedrandomvectorf(X)=(f1(X1),...,fd(Xd)),whereeachfjisamonotonicfunction.Thisallowsarbitrarysinglevariablemarginalprobabilitydistributionsinthemodel[5].1\fBoththeGaussiangraphicalmodelandthenonparanormalmaintaintractableinferencewithoutplacinglimitationsontheindependencegraph.Buttheyarelimitedintheirabilityto\ufb02exiblymodelthebivariateandhigherordermarginals.Atanotherextreme,forest-structuredgraphicalmodelspermitarbitrarybivariatemarginals,butmaintaintractabilitybyrestrictingtoacyclicgraphs.Annonparametricapproachbasedonforestsandtreesisdevelopedin[7]asanonparametricmethodforestimatingthedensityinhigh-dimensionalsettings.However,theabilitytomodelcomplexindependencegraphsiscompromised.InthispaperwebringtogethertheGaussian,nonparanormal,andforestgraphicalmodels,usingwhatwecallblossomtreegraphicalmodels.Informally,ablossomtreeconsistsofaforestoftrees,andacollectionofsubgraphs\u2013theblossoms\u2014possiblycontainingmanycycles.Thevertexsetsoftheblossomsaredisjoint,andeachblossomcontainsatmostonenodeofatree.Weestimatenonparanormalgraphicalmodelsovertheblossoms,andnonparametricbivariatedensitiesoverthebranches(edges)ofthetrees.Usingthepropertiesofthenonparanormal,thesecomponentscanbecombined,orfactored,togiveavalidjointdensityforX=(X1,...,Xd).ThedetailsofourconstructionaregiveninSection2.Wedevelopanestimationprocedureforblossomtreegraphi-calmodels,includinganalgorithmforselectingtreebranches,partitiontheremainingverticesintopotentialblossoms,andthenestimatingthegraphicalstructuresoftheblossoms.Sinceanobjec-tiveistorelaxtheGaussianassumption,ourcriterionforselectingtreebranchesisdeviationfromGaussianity.Towardthisend,weusethenegentropy,showingthatithasstrongstatisticalpropertiesinhighdimensions.Inordertopartitionthenodesintoblossoms,weemployanonparametricpar-tialcorrelationstatistic.Weuseadata-splittingschemetoselecttheoptimalblossomtreestructurebasedonheld-outrisk.Inthefollowingsection,wepresentthedetailsofourmethod,includingde\ufb01nitionsofblossomtreegraphs,theassociatedfamilyofgraphicalmodels,andourestimationmethods.InSections3and4,wepresentexperimentswithsimulatedandrealdata.Finally,weconcludeinSection5.Statisticalproperties,detailedproofs,andfurtherexperimentalresultsarecollectedinasupplement.2BlossomTreeGraphsandEstimationMethodsTounifytheGaussian,nonparanormalandforestgraphicalmodelswemakethefollowingde\ufb01nition.De\ufb01nition2.1.AblossomtreeonanodesetV={1,2,...,d}isagraphG=(V,E),togetherwithadecompositionoftheedgesetEasE=F\u222a{\u222aB\u2208BB}satisfyingthefollowingproperties:1.Fisacyclic;2.V(B)\u2229V(B0)=\u2205,forB,B0\u2208BwithB6=B0,whereV(B)denotesthevertexsetofB.3.|V(B)\u2229V(F)|\u22641foreachB\u2208B;4.V(F)\u222aSBV(B)=V.ThesubgraphsB\u2208Barecalledblossoms.Theuniquenode\u03c1(B)\u2208V(B)\u2229V(F),whichmaybeempty,iscalledthepediceloftheblossom.ThesetofpedicelsisdenotedP(F)\u2282V(F).Property1saysthatthesetofedgesFformsaunionoftrees\u2014aforest.Property2saysthatdistinctblossomssharenoverticesoredgesincommon.Property3saysthateachblossomisconnectedtoatmostonetreenode.Property4saysthateverynodeinthegraphiseitherinatreeorablossom.Notethattheblossomsarenotrequiredtobeconnected,butmusthaveatmostonevertexincommonwiththeforest\u2014thisisthepedicelnode.2\f(a)blossomtree(b)violation(c)blossomtree(d)violationFigure1:Fourgraphs,twoblossomtrees.Thetreeedgesarecoloredblue,theblossomedgesarecoloredblack,andpedicelsareorange.Graphs(a)and(c)correspondtoblossomtrees.Graphs(b)and(d)violatetherestrictionthateachblossomhasonlyasinglepedicel,orattachmenttoatree.Supposethatp(x)=p(x1,...,xd)isthedensityofadistributionthathasanindependencegraphgivenbyablossomtreeF\u222a{\u222aBB}.Thenfromtheblossomtreepropertieswehavethatp(x)=p(XV(F))YB\u2208Bp(XV(B)|XV(F))(2.1)=p(XV(F))YB\u2208Bp(XV(B)|X\u03c1(B))(2.2)=p(XV(F))YB\u2208Bp(XV(B))p(X\u03c1(B))(2.3)=Y(s,t)\u2208Fp(Xs,Xt)p(Xs)p(Xt)Ys\u2208V(F)p(Xs)YB\u2208Bp(XV(B))p(X\u03c1(B))(2.4)=Y(s,t)\u2208Fp(Xs,Xt)p(Xs)p(Xt)Ys\u2208V(F)\\P(F)p(Xs)YB\u2208Bp(XV(B)).(2.5)The\ufb01rstequalityfollowsfromdisjointnessoftheblossoms.Thesecondequalityfollowsfromtheexistenceofasinglepedicelnodeattachingtheblossomtoatree.Thefourthequalityfollowsfromthestandardfactorizationofforests,andthelastequalityfollowsfromthefactthateachnon-emptypedicelforablossomisunique.Wecallthesetofdistributionsthatfactorinthiswaythefamilyofblossomtreegraphicalmodels.Akeypropertyofthenonparanormal[5]isthatthesinglenodemarginalprobabilitiesp(Xs)arearbitrary.Thispropertyallowsustoformgraphicalmodelswhereeachblossomdistributionsatis\ufb01esXV(B)\u223cNPN(\u00b5B,\u03a3B,fB),whileenforcingthatthesinglenodemarginalofthepedicel\u03c1(B)agreeswiththemarginalsofthisnodede\ufb01nedbytheforest.Thisallowsustode\ufb01neandestimatedistributionsthatareconsistentwiththefactorization(2.5).LetX(1),...,X(n)beni.i.d.Rd-valueddatavectorssampledfromp\u2217(x)whereX(l)=(X(l)1,...,X(l)d).Ourgoalistoderiveamethodforhigh-dimensionalundirectedgraphestima-tionanddensityestimation,usingafamilyofsemiparametricestimatorsbasedontheblossomtreestructure.LetFBdenotetheblossomtreestructureF\u222a{\u222aBB}.Ourestimationprocedureisthefollowing.First,randomlypartitionthedataX(1),...,X(n)intotwosetsD1andD2ofsamplesizen1andn2.Thenapplythefollowingsteps.1.UsingD1,estimatethebivariatedensitiesp\u2217(xi,xj)usingkerneldensityestimation.Also,estimatethecovariance\u03a3ijforeachpairofvariables.ApplyKruskal\u2019salgorithmontheestimatedpairwisenegentropymatrixtoconstructafamilyofforests{bF(k)}withk=0,...,d\u22121edges;2.UsingD1,foreachforestbF(k)obtainedinStep1,buildtheblossomtree-structuredgraphbF(k)bB.TheforeststructurebF(k)ismodeledbynonparametrickerneldensityestimators,whileeachblossombB(k)iismodeledbythegraphicallassoornonparanormal.Afamilyof3\fgraphsisobtainedbycomputingregularizationpathsfortheblossoms,usingthegraphicallasso.3.UsingD2,choosebF(bk)bBfromthisfamilyofblossomtreemodelsthatmaximizestheheld-outlog-likelihood.Thedetailsofeachsteparepresentedbelow.2.1Step1:ConstructAFamilyofForestsIninformationtheoryandstatistics,negentropyisusedasameasureofdistancetonormality.ThenegentropyiszeroforGaussiandensitiesandisalwaysnonnegative.Thenegentropybetweenvari-ablesXiandXjisde\ufb01nedasJ(Xi;Xj)=H(\u03c6(xi,xj))\u2212H(p\u2217(xi,xj)),(2.6)whereH(\u00b7)denotesthedifferentialentropyofadensity,and\u03c6(xi,xj)isanGaussiandensitywiththesamemeanandcovariancematrixasp\u2217(xi,xj).Kruskal\u2019salgorithm[4]isagreedyalgorithmto\ufb01ndamaximumweightspanningtreeofaweightedgraph.Ateachstepitincludesanedgeconnectingthepairofnodeswiththemaximumweightamongallunvisitedpairs,ifdoingsodoesnotformacycle.Thealgorithmalsoresultsinthebestk-edgeweightedforestafterk<dedgeshavebeenincluded.Inoursetting,wede\ufb01netheweightw(i,j)ofnodesiandjasthenegentropybetweenXiandXj,anduseKruskal\u2019salgorithmtobuildthemaximumweightspanningforestbF(k)withkedgeswherek<d.Insuchaway,thepairsofnodesthatarelesslikelytobeabivariateGaussianareincludedintheforestandthenaremodelednonparametrically.Sincethetruedensityp\u2217isunknown,wereplacethepopulationnegentropyJ(Xi;Xj)bytheesti-matebJn1(Xi;Xj)=H(b\u03c6n1(xi,xj))\u2212bH(bpn1(xi,xj)),(2.7)whereb\u03c6n1(xi,xj)isanestimateoftheGaussiandensity\u03c6(xi,xj)forXiandXjusingD1,bpn1(xi,xj)isabivariatekerneldensityestimateforXiandXj,andbH(\u00b7)denotestheempiricaldifferentialentropy.Inparticular,let\u03a3ijbethecovariancematrixofXiandXj.Denoteb\u03a3ijn1astheempiricalcovariancematrixofXiandXjbasedonD1,thentheplug-inestimateH(b\u03c6n1(xi,xj))=1+log(2\u03c0)+12logdet(b\u03a3ijn1).(2.8)LetK(\u00b7)beaunivariatekernelfunction.Thengivenanevaluationpoint(xi,xj),thebivariatekerneldensityestimatefor(Xi,Xj)basedonobservations{X(l)i,X(l)j}l\u2208D1isgivenbybpn1(xi,xj)=1n1Xl\u2208D11h2ih2jK X(l)i\u2212xih2i!K X(l)j\u2212xjh2j!,(2.9)whereh2iandh2jarebandwidthparametersfor(Xi,Xj).TocomputetheempiricaldifferentialentropybH(bpn1(xi,xj)),wenumericallyevaluateatwo-dimensionalintegral.OncetheestimatednegentropymatrixhbJn1(Xi;Xj)id\u00d7disobtained,weapplyKruskal\u2019salgorithmtoconstructafamilyofforests{bF(k)}k=0...d\u22121.2.2Step2:BuildandModeltheBlossomTreeGraphsSupposethatwehaveaforest-structuredgraphFwith|V(F)|<dvertices.Thenforeachremain-ingnon-forestnode,weneedtodeterminewhichblossomitbelongsto.Weexploitthefollowingbasicfact.4\fProposition2.1.SupposethatX\u223cp\u2217isadensityforablossomtreegraphicalmodelwithforestF.Leti6\u2208V(F)ands\u2208V(F).ThennodeiisnotinablossomattachedtotreenodesifandonlyifXi\u22a5\u22a5Xs|Xtforsomenodet\u2208V(F)suchthat(s,t)\u2208E(F).(2.10)Weusethisproperty,togetherwithameasureofpartialcorrelation,inordertopartitionthenon-forestnodesintoblossoms.Partialcorrelationmeasuresthedegreeofassociationbetweentworandomvariables,withtheeffectofasetofcontrollingrandomvariablesremoved.Traditionally,thepartialcorrelationbetweenvariablesXiandXsgivenacontrollingvariableXtisthecorrelationbetweentheresiduals\u0001i\\tand\u0001s\\tresultingfromthelinearregressionofXiwithXtandofXswithXt,respectively.However,iftheunderlyingjointGaussianornonparanormalassumptionisnotsatis\ufb01ed,linearregressioncannotremovealloftheeffectsofthecontrollingvariable.Wethususeanonparametricversionofpartialcorrelation.Following[1],supposeXi=g(Xt)+\u0001i\\tandXs=h(Xt)+\u0001s\\t,forcertainfunctionsgandhsuchthatE(\u0001i\\t|Xt)=0andE(\u0001s\\t|Xt)=0.De\ufb01nethenonparametricpartialcorrelationas\u03c1is\u00b7t=E(\u0001i\\t\u0001s\\t).qE(\u00012i\\t)E(\u00012s\\t).(2.11)Itisshownin[1]thatifXi\u22a5\u22a5Xs|Xt,then\u03c1is\u00b7t=0.Wethusconcludethefollowing.Proposition2.2.If\u03c1is\u00b7t6=0foralltsuchthat(s,t)\u2208E(F),nodeiisinablossomattachedtonodes.Letbgandbhbelocalpolynomialestimatorsofgandh,andb\u0001(l)i\\t=X(l)i\u2212bg(X(l)t),b\u0001(l)s\\t=X(l)s\u2212bh(X(l)t)foranyl\u2208D1,thenanestimateof\u03c1is\u00b7tisgivenbyb\u03c1is\u00b7t=Xl\u2208D1(b\u0001(l)i\\tb\u0001(l)s\\t).sXl\u2208D1(b\u0001(l)i\\t)2Xl\u2208D1(b\u0001(l)s\\t)2.(2.12)BasedonProposition2.2,foreachforestbF(k)obtainedinStep1,wethenassigneachnon-forestnodeitotheblossomwiththepedicelgivenbybsi=argmaxs\u2208V(bF(k))min{t:(s,t)\u2208E(bF(k))}|b\u03c1is\u00b7t|.(2.13)Afteriteratingoverallnon-forestnodes,weobtainablossomtree-structuredgraphbF(k)bB.Thentheforeststructureisnonparametricallymodeledbythebivariateandunivariatekerneldensityes-timations,whileeachblossomismodeledwiththegraphicallassoornonparanormal.Inparticular,whenk=0thatthereisnoforestnode,ourmethodisreducedtomodelingtheentiregraphbythegraphicallassoornonparanormal.Alternativetestingproceduresbasedonnonparametricpartialcorrelationscouldbeadoptedforpar-titioningnodesintoblossoms.However,suchmethodsmayhavelargecomputationalcost,andlowpowerforsmallsamplesizes.Notethatwhileeachnon-forestnodeisassociatedwithapedicelinthisstep,aftergraphestimationfortheblossoms,thenodemaywellbecomedisconnectedfromtheforest.2.3Step3:OptimizetheBlossomTreeGraphsThefullblossomtreegraphbF(d\u22121)bBobtainedinSteps1and2mightresultinover\ufb01ttinginthedensityestimate.Thusweneedtochooseanoptimalgraphwiththenumberofforestedgesk\u2264d\u22121.Besides,thetuningparametersinvolvedinthe\ufb01ttingofeachblossombythegraphicallassoornonparanormalalsoinduceabias-variancetradeoff.5\fTooptimizetheblossomtreestructuresover{bF(k)bB}k=0...d\u22121,wechoosethecomplexityparameterbkastheonethatmaximizesthelog-likelihoodonD2,usingthefactorization(2.5):bk=argmaxk\u2208{0,...,d\u22121}1n2Xl\u2208D2log\uf8eb\uf8edY(i,j)\u2208E(bF(k))bpn1(X(l)i,X(l)j)bpn1(X(l)i)bpn1(X(l)j)\u00b7Ys\u2208V(bF(k))\\P(bF(k))bpn1(X(l)s)kYi=1b\u03c6\u03bb(k)in1(cid:0)X(l)V(bB(k)i)(cid:1)\uf8f6\uf8f8,(2.14)whereb\u03c6\u03bb(k)in1isthedensityestimateforblossomsbythegraphicallassoornonparanormal,withthetuningparameter\u03bb(k)iselectedtomaximizetheheld-outlog-likelihood.Thatis,thecomplexityofeachblossomisalsooptimizedonD2.Thusthe\ufb01nalblossomtreedensityestimatorisgivenbypbF(bk)bB(x)=Y(i,j)\u2208E(bF(bk))bpn1(xi,xj)bpn1(xi)bpn1(xj)Ys\u2208V(bF(bk))\\P(bF(bk))bpn1(X(l)s)bkYi=1b\u03c6\u03bb(bk)in1(xbB(bk)i).(2.15)Inpractice,Step3canbecarriedoutsimultaneouslywithSteps1and2.WheneveranewedgeisaddedtothecurrentforestinKruskal\u2019salgorithm,theblossomsarere-constructedandre-modeled.Thentheheld-outlog-likelihoodoftheobtaineddensityestimatorcanbeimmediatelycomputed.Inaddition,sincetherearenooverlappingnodesbetweendifferentblossoms,thesparsitytuningparametersareselectedseparatelyforeachblossom,whichreducesthecomputationalcostconsid-erably.3AnalysisofSimulatedDataHerewepresentnumericalresultsbasedonsimulations.Wecomparetheblossomtreedensityestimatorwiththegraphicallasso[3]andforestdensityestimator[7].Toevaluatetheperformanceoftheseestimators,wecomputeandcomparethelog-likelihoodofeachmethodonheld-outdata.Wesimulatehigh-dimensionaldatawhichareconsistentwithanundirectedgraph.Wegeneratemul-tivariatenon-GaussiandatausingasequenceofmixturesoftwoGaussiandistributionswithcontrarycorrelationandequalweights.ThenasubsetofvariablesarechosentogeneratetheblossomsthataredistributedasmultivariateGaussians.Indimensionald=80,wesamplen1=n2=400datapointsfromthissyntheticdistribution.Atypicalrunshowingtheheld-outlog-likelihoodandestimatedgraphsisprovidedinFigures2and3.Theterm\u201ctrunk\u201disusedtorepresenttheedgeaddedtotheforestinablossomtreegraph.Wecanseethattheblossomtreedensityestimatorissuperiortoothermethodsintermsofgeneralizationperformance.Inparticular,thegraphicallassoisunabletouncovertheedgesthataregeneratednonparametrically.Thisisexpected,sincedifferentblossomshavezerocorrelationsamongeachotherandarethusregardedasindependentbythealgorithmofgraphicallasso.ForthemodelingofthevariablesthatarecontainedinablossomandarethusmultivariateGaussiandistributed,thereisanef\ufb01ciencylossintheforestdensityestimator,comparedtothegraphicallasso.Thisillustratestheadvantageofblossomtreedensityestimator.AsisseenfromthenumberofselectededgesbyeachmethodshowninFigure2,theblossomtreedensityestimatorselectsagraphwithasimilarsparsitypatternasthetruegraph.4AnalysisofCellSignallingDataWeanalyzea\ufb02owcytometrydatasetond=11proteinsfrom[9].Asubsetofn=853cellswerechosen.Anonparanormaltransformationwasestimatedandtheobservations,foreachvariable,6\f020406080\u2212113\u2212112\u2212111\u2212110\u2212109\u2212108Number of trunksHeld out log\u2212likelihood02040608020304050607080Number of trunksNumber of selected edgesFigure2:Resultsonsimulations.Left:Held-outlog-likelihoodofthegraphicallasso(horizontaldottedline),forestdensityestimator(horizontaldashedline),andblossomtreedensityestimator(circles);Right:Numberofselectededgesbythesemethods.Thehorizontalsolidlineindicatesthenumberofedgesinthetruegraph,andthesolidtriangleindicatesthebestblossomtreegraph.The\ufb01rstcircleforblossomtreereferstothe1-trunkcase.trueglassoforestforest\u2212blossomtrueglassoforestforest\u2212blossomtrueglassoforestforest\u2212blossomtrueglassoforestforest\u2212blossom(a)true(b)glasso(c)forest(d)blossomtreeFigure3:Resultsonsimulations.Graph(a)correspondstothetruegraph.Graphs(b),(c)and(d)correspondtotheestimatedgraphsbythegraphicallasso,forestdensityestimator,andblossomtreedensityestimator,respectively.Thetreeedgesarecoloredred,andtheblossomedgesarecoloredblack.werereplacedbytheirrespectivenormalscores,subjecttoaWinsorizedtruncation[5].Westudytheassociationsamongtheproteinsusingthegraphicallasso,forestdensityestimator,andblossomtreeforestdensityestimator.Themaximumheld-outlog-likelihoodforglasso,forest,andblossomtreeare-14.3,-13.8,and-13.7,respectively.Wecanseethatblossomtreeisslighterbetterthanforestintermsofthegeneralizationperformance,bothofwhichoutperformglasso.ResultsofestimatedgraphsareprovidedinFigures4.Whenthemaximumofheld-outlog-likelihoodcurveisachieved,glassoselects28edges,forestselects7edges,andblossomtreeselects10edges.Thetwographsuncoveredbyforestandblossomtreeagreeonmostedges,althoughthelattercontainscycles.5ConclusionWehaveproposedacombinationoftree-basedgraphicalmodelsandGaussiangraphicalmodelstoformanewnonparametricapproachforhighdimensionaldata.Blossomtreemodelsrelaxthenor-malityassumptionandincreasestatisticalef\ufb01ciencybymodelingtheforestwithnonparametricker-neldensityestimatorsandmodelingeachblossomwiththegraphicallassoornonparanormal.Ourexperimentalresultsindicatethatthismethodcanbeapracticalalternativetostandardapproachestographanddensityestimation.7\f(a)graphreportedin[9](b)glasso(c)forest(d)blossomtreeFigure4:Resultsoncellsignallingdata.Graph(a)referstothe\ufb01ttedgraphreportedin[9].Graphs(b),(c)and(d)correspondtotheestimatedgraphsbythegraphicallasso,forestdensityestimator,andblossomtreedensityestimator,respectively.AcknowledgementsResearchsupportedinpartbyNSFgrantIIS-1116730,AFOSRgrantFA9550-09-1-0373,ONRgrantN000141210762,andanAmazonAWSinEducationMachineLearningResearchgrant.References[1]WicherBergsma.Anoteonthedistributionofthepartialcorrelationcoef\ufb01cientwithnonpara-metricallyestimatedmarginalregressions.arXiv:1101.4616,2011.[2]T.TonyCai,TengyuanLiang,andHarrisonH.Zhou.Lawoflogdeterminantofsamplecovariancematrixandoptimalestimationofdifferentialentropyforhigh-dimensionalgaussiandistributions.arXiv:1309.0482,2013.[3]JeromeH.Friedman,TrevorHastie,andRobertTibshirani.Sparseinversecovarianceestima-tionwiththegraphicallasso.Biostatistics,9(3):432\u2013441,2008.[4]JosephB.Kruskal.Ontheshortestspanningsubtreeofagraphandthetravelingsalesmanproblem.InProceedingsoftheAmericanMathematicalSociety,volume7,pages48\u201350,1956.[5]HanLiu,JohnLafferty,andLarryWasserman.Thenonparanormal:Semiparametricestimationofhighdimensionalundirectedgraphs.JournalofMachineLearningResearch,10:2295\u20132328,2009.[6]HanLiu,LarryWasserman,andJohnD.Lafferty.Exponentialconcentrationformutualinfor-mationestimationwithapplicationtoforests.InAdvancesinNeuralInformationProcessingSystems(NIPS),2012.[7]HanLiu,MinXu,HaijieGu,AnupamGupta,JohnLafferty,andLarryWasserman.Forestdensityestimation.JournalofMachineLearningResearch,12:907\u2013951,2011.[8]NicolaiMeinshausenandPeterB\u00a8uhlmann.Highdimensionalgraphsandvariableselectionwiththelasso.AnnalsofStatistics,34(3),2006.[9]KarenSachs,OmarPerez,DanaPe\u2019er,DouglasA.Lauffenburger,andGarryP.Nolan.Causalprotein-signalingnetworksderivedfrommultiparametersingle-celldata.Science,308(5721):523\u2013529,2003.[10]MingYuanandYiLin.ModelselectionandestimationintheGaussiangraphicalmodel.Biometrika,94(1):19\u201335,2007.8\f", "award": [], "sourceid": 793, "authors": [{"given_name": "Zhe", "family_name": "Liu", "institution": "University of Chicago"}, {"given_name": "John", "family_name": "Lafferty", "institution": "University of Chicago"}]}