38:[["$","audio",null,{"id":"tts"}],["$","$L3d",null,{"paperID":"2003.00113","publisher":"arxiv","paperJSON":{"title":"Structure-Adaptive Sequential Testing for Online False Discovery Rate Control","paperID":"2003.00113","avgLineHeight":21.68,"imgScale":4,"sections":[{"heading":"Abstract","paragraphs":[[{"style":{"width":"87%"},"width":1562,"height":956,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/0-0.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Keywords: ","element":"span"},{"text":"Alpha–investing; Conditional local false discovery rate; Covariate–assisted infer-","element":"span"}],[{"text":"ence; Structured multiple testing; Time series anomaly detection","element":"span"}]]},{"heading":"1 Introduction","paragraphs":[[{"text":"The online testing problem is concerned with the investigation of a possibly infinite stream of null hypotheses ","element":"span"},{"style":{"height":17.6},"width":251.22,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/1-0.png","element":"img","alt":" {H1, H2, · · · }","inline":true,"padRight":true},{"text":"in an ongoing manner based on sequentially collected data ","element":"span"},{"style":{"height":17.6},"width":262.77,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/1-1.png","element":"img","alt":"{X1, X2, · · · }.","inline":true,"padRight":true},{"text":"At each time point, the investigator must make a real-time decision after ","element":"span"},{"style":{"height":14.62},"width":48.15,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/1-2.png","element":"img","alt":"Xt","inline":true,"padRight":true},{"text":"arrives, without knowing future data ","element":"span"},{"style":{"height":17.6},"width":339.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/1-3.png","element":"img","alt":" {Xt+1, Xt+2, · · · }.","inline":true,"padRight":true},{"text":"The control of multiplicity in sequential testing typically involves imposing serial constraints on error rates over time, which requires that, for example, the family wise error rate (FWER) or false discovery rate (FDR; ","element":"span"},{"href":"#id-0","referenceIndex":4,"text":"Benjamini and Hochberg, ","element":"a"},{"href":"#id-0","referenceIndex":4,"text":"1995) ","element":"a"},{"text":"must fall below a pre–specified level ","element":"span"},{"style":{"height":12.8},"width":145.65,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/1-4.png","element":"img","alt":" α at all","inline":true,"padRight":true},{"text":"decision points.","element":"span"}],[{"text":"The online testing problem may arise from a range of applications. For example, the quality preserving database (QPD) framework ","element":"span"},{"href":"#id-1","referenceIndex":1,"text":"(Aharoni et al., ","element":"a"},{"href":"#id-1","referenceIndex":1,"text":"2010) ","element":"a"},{"text":"$3e","element":"span"}],[{"text":"Large-scale testing under the online setup poses several new issues that are not present in conventional “offline” setup. First, a real–time decision must be made before the next data point arrives. This makes conventional step–wise testing methods no longer applicable. For instance, the well–known Holm’s procedure ","element":"span"},{"href":"#id-2","referenceIndex":11,"text":"(Holm, ","element":"a"},{"href":"#id-2","referenceIndex":11,"text":"1979) ","element":"a"},{"text":"for FWER control and Benjamini– Hochberg’s procedure for FDR control both involve first ordering ","element":"span"},{"style":{"fontStyle":"italic"},"text":"all ","element":"span"},{"text":"observed ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"$3f","element":"span"}],[{"text":"The online FDR control problem has received much recent attention and great progresses have been made. ","element":"span"},{"text":"The alpha-investing (AI) idea ","element":"span"},{"href":"#id-3","referenceIndex":9,"text":"(Foster and Stine, ","element":"a"},{"href":"#id-3","referenceIndex":9,"text":"2008) ","element":"a"},{"text":"and its various generalizations ","element":"span"},{"href":"#id-4","referenceIndex":2,"text":"(Aharoni and Rosset, ","element":"a"},{"href":"#id-4","referenceIndex":2,"text":"2014; ","element":"a"},{"href":"#id-5","referenceIndex":19,"text":"Ramdas et al., ","element":"a"},{"href":"#id-5","referenceIndex":19,"text":"2017; ","element":"a"},{"href":"#id-6","referenceIndex":13,"text":"Javanmard et al., ","element":"a"},{"href":"#id-6","referenceIndex":13,"text":"2018) ","element":"a"},{"text":"have served as the basic framework and proved to be effective. Carefully designed AI rules are capable of handling an infinite stream of hypotheses and incorporating informative domain knowledge into the dynamic decision-making process. Beginning with a pre–specified alpha– wealth, the key idea in AI algorithms is that each rejection gains extra alpha–wealth, which may be subsequently used to make more discoveries at later time points. The generalized AI (GAI) algorithms ","element":"span"},{"href":"#id-4","referenceIndex":2,"text":"(Aharoni and Rosset, ","element":"a"},{"href":"#id-4","referenceIndex":2,"text":"2014; ","element":"a"},{"href":"#id-7","referenceIndex":21,"text":"Robertson and Wason, ","element":"a"},{"href":"#id-7","referenceIndex":21,"text":"2018; ","element":"a"},{"href":"#id-8","referenceIndex":18,"text":"Lynch et al., ","element":"a"},{"href":"#id-8","referenceIndex":18,"text":"2017) ","element":"a"},{"text":"are developed for a wider class of pay-out functions, enabling the construction of new online rules with increased power. The GAI++ framework ","element":"span"},{"href":"#id-5","referenceIndex":19,"text":"(Ramdas et al., ","element":"a"},{"href":"#id-5","referenceIndex":19,"text":"2017) ","element":"a"},{"text":"improves the power of GAI methods uniformly and is capable of dealing with more general settings. The new class of weighted GAI++ methods are flexibly designed to allow “indecisions” and are capable of integrating prior domain knowledge. To alleviate the “piggybacking” and “alpha–death” issues of AI rules, ","element":"span"},{"href":"#id-5","referenceIndex":19,"text":"Ramdas et al. ","element":"a"},{"href":"#id-5","referenceIndex":19,"text":"(2017) ","element":"a"},{"text":"discussed the concept of decaying memory FDR. To effectively incorporate structural information into online inference, the SAFFRON procedure ","element":"span"},{"href":"#id-9","referenceIndex":20,"text":"(Ramdas et al., ","element":"a"},{"href":"#id-9","referenceIndex":20,"text":"2018) ","element":"a"},{"text":"derived a sequence of thresholds that are adaptive to estimated sparsity levels and showed that the power can be much improved.","element":"span"}],[{"text":"This article develops a new class of structure–adaptive sequential testing (SAST) rules for online FDR control with several new features. First, in contrast with existing AI and GAI rules whose building blocks are ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values, the class of SAST rules are built upon the conditional local false discovery rate (Clfdr), which optimally adapts to important local structures in the data stream. Second, the sequential rejection rule based on Clfdr leads to a novel alpha– investing framework that is fundamentally different from that in ","element":"span"},{"href":"#id-3","referenceIndex":9,"text":"Foster and Stine ","element":"a"},{"href":"#id-3","referenceIndex":9,"text":"(2008)","element":"a"},{"text":". The new framework precisely characterizes the tradeoffs between different actions in online decision making, which provides key insights for designing more powerful online FDR rules. The new AI framework also reveals that SAST automatically avoids the “alpha–death” issue in the sense that its operation always reserves budget to reject new hypotheses, and can proceed in an ongoing manner to any time point in the future. Finally, by adaptively learning from past experiences and dynamically allocating the alphawealth, SAST can effectively avoid the “piggybacking” issue and improve its performance as more data are acquired. Our theoretical and numerical results demonstrate that SAST is effective for online FDR control, and achieves substantial power gain over existing methods in many settings.","element":"span"}],[{"text":"The article is organized as follows. ","element":"span"},{"text":"Section 2 first introduces the model and problem formulation, and then develops the oracle SAST procedure for online FDR control by assuming that model parameters are known. Section 3 discusses computational algorithms, proposes the data-driven SAST rule and establishes its theoretical properties. Simulation is conducted in Section 4 to investigate the finite sample performance of SAST and compare it with existing methods. SAST is illustrated in Section 5 through applications for identifying differentially expressed genes and detecting anomalies in time series data. The proofs are provided in the online supplementary material.","element":"span"}]]},{"heading":"2 Oracle and Adaptive Rules for Online FDR Control","paragraphs":[[{"text":"We first describe the model and problem formulation in Section ","element":"span"},{"href":"#id-10","text":"2.1, ","element":"a"},{"text":"then discuss three key elements in the proposed SAST rule in turn: a new test statistic to capture the structural information in the data stream (Sections 2.2 and 2.3); a new alpha–investing framework to characterize the gains and losses in sequential decision making (Section 2.4); and a new adaptive learning algorithm to optimize the alpha–wealth allocation (Sections 2.5).","element":"span"}],[{"id":"id-10","style":{"fontWeight":"bold"},"text":"2.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Model and Problem Formulation","element":"span"}],[{"text":"Denote ","element":"span"},{"style":{"fontStyle":"italic"},"text":"T ","element":"span"},{"text":"a continuous temporal domain and ","element":"span"},{"style":{"height":14},"width":112.74,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-0.png","element":"img","alt":" t ∈ T","inline":true,"padRight":true},{"text":"a time point. Let ","element":"span"},{"style":{"height":14},"width":130.92,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-1.png","element":"img","alt":" T ⊂ T","inline":true,"padRight":true},{"text":"be a discrete, ordered and evenly spaced index set for time labels","element":"span"},{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-2.png","element":"img","alt":"1","inline":true},{"text":". Suppose we are interested in testing a sequence of null hypotheses ","element":"span"},{"style":{"height":17.6},"width":230.34,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-3.png","element":"img","alt":" {Ht : t ∈ T}","inline":true,"padRight":true},{"text":"based on data stream ","element":"span"},{"style":{"height":17.6},"width":303.71,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-4.png","element":"img","alt":" XXX = (Xt : t ∈ T","inline":true},{"text":"). To describe the true states of nature, define Bernoulli variables ","element":"span"},{"style":{"height":17.6},"width":486.17,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-5.png","element":"img","alt":" θt, where θt = 0/1 if Ht","inline":true,"padRight":true},{"text":"is true/false. Let ","element":"span"},{"style":{"height":17.6},"width":490.47,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-6.png","element":"img","alt":" {πt ≡ P(θt = 1) : t ∈ T }","inline":true,"padRight":true},{"text":"denote the local sparsity levels that may vary over time. The observations can be described using a hierarchical model:","element":"span"}],[{"id":"id-11","style":{"width":"78%"},"width":1390,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-7.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.02},"width":204.36,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-8.png","element":"img","alt":" F0 and F1t","inline":true,"padRight":true},{"text":"are the null and non-null distributions, respectively. Denote ","element":"span"},{"style":{"height":16.4},"width":269.46,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-9.png","element":"img","alt":" f0 and f1t the","inline":true,"padRight":true},{"text":"corresponding density functions. We assume that ","element":"span"},{"style":{"height":14.62},"width":45.06,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-10.png","element":"img","alt":" F0","inline":true,"padRight":true},{"text":"is known and identical for all ","element":"span"},{"style":{"height":16.8},"width":190.15,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-11.png","element":"img","alt":" t ∈ T . By","inline":true,"padRight":true},{"text":"contrast, ","element":"span"},{"style":{"height":16.4},"width":188.8,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-12.png","element":"img","alt":" πt and f1t","inline":true,"padRight":true},{"text":"can vary smoothly in ","element":"span"},{"style":{"height":14},"width":115.95,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/4-13.png","element":"img","alt":" t ∈ T .","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Remark 1. ","element":"span"},{"text":"The inhomogeneity assumption reflects that signals may either vary in strengths or arrive at different rates over time. This structural information can be highly informative. The smoothness assumption makes it possible for pooling information from the observations in the neighborhood of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". We do not impose further assumptions on ","element":"span"},{"style":{"height":15.02},"width":193.01,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-0.png","element":"img","alt":" πt and F1t","inline":true},{"text":", both of which will be estimated non-parametrically.","element":"span"}],[{"text":"Let ","element":"span"},{"style":{"height":18.73},"width":492.9,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-1.png","element":"img","alt":" XXXt = (Xi : i ∈ T; i ≤ t","inline":true},{"text":") be the collection of summary statistics (e.g. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"–values or ","element":"span"},{"style":{"fontStyle":"italic"},"text":"z","element":"span"},{"text":"–values) up to time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". Consider a class of online decision rules ","element":"span"},{"style":{"height":19.53},"width":575.95,"height":48.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-2.png","element":"img","alt":" δδδ = {δt(XXXt) : t ∈ T} ∈ {0, 1}T,","inline":true,"padRight":true},{"text":"where ","element":"span"},{"style":{"height":18.73},"width":102.16,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-3.png","element":"img","alt":" δt(XXXt","inline":true},{"text":") represents a ","element":"span"},{"style":{"fontStyle":"italic"},"text":"real-time decision ","element":"span"},{"text":"in the sense that ","element":"span"},{"style":{"height":15.02},"width":31.4,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-4.png","element":"img","alt":" δt","inline":true,"padRight":true},{"text":"only depends on information available at time ","element":"span"},{"style":{"height":15.6},"width":177.91,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-5.png","element":"img","alt":" t, with δt","inline":true,"padRight":true},{"text":"= 1 indicating that ","element":"span"},{"style":{"height":14.62},"width":48.27,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-6.png","element":"img","alt":" Ht","inline":true,"padRight":true},{"text":"is rejected and ","element":"span"},{"style":{"height":15.02},"width":31.4,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-7.png","element":"img","alt":" δt","inline":true,"padRight":true},{"text":"= 0 otherwise. Denote ","element":"span"},{"style":{"height":19.13},"width":546.46,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-8.png","element":"img","alt":"δδδt = {δi(XXXi) : i ∈ T; i ≤ t}","inline":true,"padRight":true},{"text":"the collection of decisions up to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". The online FDR problem is","element":"span"}],[{"text":"concerned with the performance of a stream of real–time decisions. For decisions up to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", let","element":"span"}],[{"id":"id-12","style":{"width":"69%"},"width":1241,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-9.png","element":"img"}],[{"text":"where the superscript “t” denotes that the FDR is evaluated at a specific time point. The goal is to construct a real–time decision rule ","element":"span"},{"style":{"height":18.73},"width":393.94,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-10.png","element":"img","alt":" δδδ = {δt(XXXt) : t ∈ T}","inline":true,"padRight":true},{"text":"that controls the FDR","element":"span"},{"style":{"height":15.28},"width":68.9,"height":38.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-11.png","element":"img","alt":"t at","inline":true,"padRight":true},{"text":"level ","element":"span"},{"style":{"height":13.2},"width":265.56,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-12.png","element":"img","alt":" α for all t ∈ T","inline":true},{"text":". To compare the power of different testing rules, define the average power","element":"span"}],[{"text":"(AP) and missed discovery rate (MDR) as","element":"span"}],[{"id":"id-26","style":{"width":"79%"},"width":1414,"height":119,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-13.png","element":"img"}],[{"text":"To simplify the discussion, throughout this section we assume that the distributional information such as the non-null proportion ","element":"span"},{"style":{"height":10.22},"width":36.87,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-14.png","element":"img","alt":" πt","inline":true,"padRight":true},{"text":"and density function ","element":"span"},{"style":{"height":16.4},"width":33.36,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/5-15.png","element":"img","alt":" ft","inline":true,"padRight":true},{"text":"in Model ","element":"span"},{"href":"#id-11","text":"2.1 ","element":"a"},{"text":"are known. Section 3 considers the case where model parameters are unknown and discusses in detail related estimation and implementation issues.","element":"span"}],[{"id":"id-43","style":{"fontWeight":"bold"},"text":"2.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"The oracle rule for simultaneous testing","element":"span"}],[{"text":"The goal of this section is to justify the fundamental role of Clfdr as the building block of the proposed online FDR rule.","element":"span"}],[{"text":"The online decision-making process is complicated due to the serial constraints on FDR and absence of future data. To focus on the essential issue, we first consider an ideal setup where a hypothetical oracle observes ","element":"span"},{"style":{"fontStyle":"italic"},"text":"all data in a local neighborhood at once ","element":"span"},{"text":"and makes ","element":"span"},{"style":{"fontStyle":"italic"},"text":"a batch of simultaneous decisions","element":"span"},{"text":". ","element":"span"},{"text":"Let ","element":"span"},{"style":{"fontStyle":"italic"},"text":"d ","element":"span"},{"text":"denote the size of a neighborhood. ","element":"span"},{"text":"Consider the collection of hypotheses in a neighborhood prior to ","element":"span"},{"style":{"height":17.6},"width":745.89,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-0.png","element":"img","alt":" t∗ ≥ d: {Hi : t∗ − d + 1 ≤ i ≤ t∗}.","inline":true,"padRight":true},{"text":"Denote the neighborhood ","element":"span"},{"style":{"height":18.4},"width":560.94,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-1.png","element":"img","alt":" Nd(t∗) = {t∗ − d + 1, · · · , t∗}","inline":true,"padRight":true},{"text":"and the simultaneous decisions ","element":"span"},{"style":{"height":13.28},"width":90.89,"height":33.21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-2.png","element":"img","alt":" δδδ∗ =","inline":true},{"style":{"height":18.89},"width":499.66,"height":47.22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-3.png","element":"img","alt":"{δ∗i : i ∈ Nd(t∗)}, where δ∗i ","inline":true,"padRight":true},{"text":"is allowed to depend on the entire ","element":"span"},{"style":{"height":18.4},"width":616.97,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-4.png","element":"img","alt":" d-vector XXX∗ = {Xi : i ∈ Nd(t∗)}.","inline":true,"padRight":true},{"text":"Unlike ","element":"span"},{"href":"#id-12","text":"(2.2)","element":"a"},{"text":", we only require that the FDR is controlled for the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"d ","element":"span"},{"text":"simultaneous decisions:","element":"span"}],[{"style":{"width":"70%"},"width":1254,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-5.png","element":"img"}],[{"text":"where the superscript “s” indicates a simultaneous–type FDR concept.","element":"span"}],[{"text":"The simultaneous testing of multiple hypotheses can be conceptualized as a two-stage inferential process: firstly ranking all hypotheses according to a significance index and secondly choosing a cutoff along the ordered sequence. This process can be described by a thresholding","element":"span"}],[{"text":"rule of the form","element":"span"}],[{"style":{"width":"29%"},"width":533,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-6.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":17.6},"width":45.94,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-7.png","element":"img","alt":" I(·","inline":true},{"text":") is an indicator function, Λ","element":"span"},{"style":{"height":8.4},"width":12,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-8.png","element":"img","alt":"i","inline":true,"padRight":true},{"text":"is the significance index of ","element":"span"},{"style":{"height":15.02},"width":175.88,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-9.png","element":"img","alt":" Hi and c","inline":true,"padRight":true},{"text":"is the cutoff of Λ","element":"span"},{"style":{"height":8.4},"width":12,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-10.png","element":"img","alt":"i","inline":true},{"text":". For example, the BH procedure uses the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-value as the significance index to order the hypotheses, and implements a step-up algorithm to determine a data-driven cutoff ","element":"span"},{"style":{"fontStyle":"italic"},"text":"c","element":"span"},{"text":".","element":"span"}],[{"text":"However, the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-value is inefficient for online FDR analysis as it fails to capture the important structural information in the data stream. We propose to use the conditional local false discovery rate (Clfdr) as the significance index to order the hypotheses:","element":"span"}],[{"style":{"width":"81%"},"width":1443,"height":104,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-11.png","element":"img"}],[{"text":"Denote Clfdr","element":"span"},{"style":{"height":19.15},"width":291.42,"height":47.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-12.png","element":"img","alt":"(1), · · · , Clfdr(d)","inline":true,"padRight":true},{"text":"the ordered Clfdr values in ","element":"span"},{"style":{"height":20.75},"width":585.89,"height":51.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/6-13.png","element":"img","alt":" Nd(t∗) and H(1), · · · , H(d) the","inline":true,"padRight":true},{"text":"corresponding hypotheses. To determine the cutoff for simultaneous testing, we apply a step-","element":"span"}],[{"text":"wise algorithm","element":"span"}],[{"id":"id-13","style":{"width":"67%"},"width":1210,"height":131,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/7-0.png","element":"img"}],[{"text":"Then the threshold is ","element":"span"},{"style":{"height":19.15},"width":233.64,"height":47.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/7-1.png","element":"img","alt":" c = Clfdr(k)","inline":true,"padRight":true},{"text":"and we reject ","element":"span"},{"style":{"height":18.75},"width":266.72,"height":46.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/7-2.png","element":"img","alt":" H(1), · · · , H(k)","inline":true},{"text":". The Clfdr rule ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"text":"may be viewed as an oracle rule that sees all data in a local neighborhood at once and then makes simultaneous decisions. In Appendix ","element":"span"},{"text":"C, ","element":"span"},{"text":"we establish the optimality property of the Clfdr rule for simultaneous testing under the “offline” setup. An infinite data stream can be approximately by sequential data points arrived in batches. Intuitively, the Clfdr statistic provides a good building block for developing new online sequential testing rules as it is optimal for simultaneous inference in each batch of data points.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Remark 2. ","element":"span"},{"text":"In the “offline” setup for simultaneous testing with a covariate sequence, which includes the Clfdr rule ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"text":"as a special case, ","element":"span"},{"href":"#id-14","referenceIndex":6,"text":"Cai et al. ","element":"a"},{"href":"#id-14","referenceIndex":6,"text":"(2019) ","element":"a"},{"text":"develops asymptotic optimality theory. We can similarly show that ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"text":"is asymptotically optimal in the sense that it achieves the benchmark of a hypothetical oracle. However, the optimality issue in the online setup, which depends on many other factors such as the optimal allocation of alpha–wealth and prediction of future patterns over time, is still an open issue and requires much research.","element":"span"}],[{"id":"id-54","style":{"fontWeight":"bold"},"text":"2.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Adapting to local structures by Clfdr: an illustration","element":"span"}],[{"text":"The incorporation of structural information and domain knowledge promises to improve the power of existing FDR procedures ","element":"span"},{"href":"#id-15","referenceIndex":10,"text":"(Genovese et al., ","element":"a"},{"href":"#id-15","referenceIndex":10,"text":"2006; ","element":"a"},{"href":"#id-16","referenceIndex":5,"text":"Cai and Sun, ","element":"a"},{"href":"#id-16","referenceIndex":5,"text":"2009; ","element":"a"},{"href":"#id-17","referenceIndex":12,"text":"Hu et al., ","element":"a"},{"href":"#id-17","referenceIndex":12,"text":"2010; ","element":"a"},{"href":"#id-18","referenceIndex":16,"text":"Lei and Fithian, ","element":"a"},{"href":"#id-18","referenceIndex":16,"text":"2018; ","element":"a"},{"href":"#id-14","referenceIndex":6,"text":"Cai et al., ","element":"a"},{"href":"#id-14","referenceIndex":6,"text":"2019)","element":"a"},{"text":". For example, the works by ","element":"span"},{"href":"#id-17","referenceIndex":12,"text":"Hu et al. ","element":"a"},{"href":"#id-17","referenceIndex":12,"text":"(2010)","element":"a"},{"text":", ","element":"span"},{"href":"#id-19","referenceIndex":17,"text":"Li and ","element":"a"},{"href":"#id-19","referenceIndex":17,"text":"Barber ","element":"a"},{"href":"#id-19","referenceIndex":17,"text":"(2019) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-20","referenceIndex":26,"text":"Xia et al. ","element":"a"},{"href":"#id-20","referenceIndex":26,"text":"(2020) ","element":"a"},{"text":"showed that the weighted ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values can be constructed to capture the varying sparsity levels of ordered or grouped hypotheses. In contrast with the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-value, the Clfdr takes into account important structural information such as ","element":"span"},{"style":{"height":16.4},"width":302.98,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/7-3.png","element":"img","alt":" πt and ft, which","inline":true,"padRight":true},{"text":"makes Clfdr an ideal building block for multiple testing with inhomogeneous data streams. We present an example to illustrate the advantage of the Clfdr rule.","element":"span"}],[{"text":"Consider the following situation where the data stream ","element":"span"},{"style":{"height":17.6},"width":391.02,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/7-4.png","element":"img","alt":" {X1, X2, . . . , Xt, . . .}","inline":true,"padRight":true},{"text":"obeys a ran-","element":"span"}],[{"text":"dom mixture model with varying sparsity levels:","element":"span"}],[{"id":"id-21","style":{"width":"67%"},"width":1205,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-0.png","element":"img"}],[{"text":"Model ","element":"span"},{"href":"#id-21","text":"(2.7) ","element":"a"},{"text":"is a special case of Model ","element":"span"},{"href":"#id-11","text":"(2.1)","element":"a"},{"text":": the null and alternative densities are fixed and the dynamic part is fully captured by the varying proportion ","element":"span"},{"style":{"height":10.22},"width":36.88,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-1.png","element":"img","alt":" πt","inline":true},{"text":". The key idea of Clfdr and weighted p-value (in the form of ","element":"span"},{"style":{"height":17.6},"width":310.53,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-2.png","element":"img","alt":" pt/wt, where wt","inline":true,"padRight":true},{"text":"is the weight for ","element":"span"},{"style":{"height":14.62},"width":48.27,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-3.png","element":"img","alt":" Ht","inline":true},{"text":") is to upweight the hypotheses in a local neighborhood where signals appear more frequently (e.g. in clusters).","element":"span"}],[{"text":"To compare the effectiveness of different weighting methods, we simulate a data stream for testing ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"= 5000 hypotheses. The top row in Figure ","element":"span"},{"href":"#id-22","text":"1 ","element":"a"},{"text":"sets ","element":"span"},{"style":{"height":14.22},"width":136.42,"height":35.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-4.png","element":"img","alt":" πt = 0.","inline":true},{"text":"5 in blocks [1001 : 1150], [2001 : 2150], [3001 : 3100] and [4001 : 4150], and ","element":"span"},{"style":{"height":14.22},"width":136.5,"height":35.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-5.png","element":"img","alt":" πt = 0.","inline":true},{"text":"01 elsewhere. We vary ","element":"span"},{"style":{"height":16.4},"width":224.22,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-6.png","element":"img","alt":" µ from 2 to","inline":true,"padRight":true},{"text":"4. The bottom row sets ","element":"span"},{"style":{"height":16.4},"width":382.93,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-7.png","element":"img","alt":" µ = 2.5 and vary πt","inline":true,"padRight":true},{"text":"from 0.2 to 0.9 in the above blocks. The block structure is highly informative and can be exploited by Clfdr and weighted p-values to improve the power. We apply the following methods at FDR level ","element":"span"},{"style":{"height":12},"width":120.08,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-8.png","element":"img","alt":" α = 0.","inline":true},{"text":"05 by assuming that the model parameters in ","element":"span"},{"href":"#id-21","text":"(2.7) ","element":"a"},{"text":"are known: BH ","element":"span"},{"href":"#id-0","referenceIndex":4,"text":"(Benjamini and Hochberg, ","element":"a"},{"href":"#id-0","referenceIndex":4,"text":"1995)","element":"a"},{"text":", the structure–adaptive BH algorithm (SABHA; ","element":"span"},{"href":"#id-19","referenceIndex":17,"text":"Li and Barber, ","element":"a"},{"href":"#id-19","referenceIndex":17,"text":"2019) ","element":"a"},{"text":"using weighted ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values with ","element":"span"},{"style":{"height":17.6},"width":313.65,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-9.png","element":"img","alt":" wt = 1/(1 − πt),","inline":true,"padRight":true},{"text":"the GAP method ","element":"span"},{"href":"#id-20","referenceIndex":26,"text":"(Xia et al., ","element":"a"},{"href":"#id-20","referenceIndex":26,"text":"2020) ","element":"a"},{"text":"using weighted ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values with ","element":"span"},{"style":{"height":17.6},"width":503.99,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-10.png","element":"img","alt":" wt = πt/(1 − πt), and the","inline":true,"padRight":true},{"text":"Clfdr rule ","element":"span"},{"href":"#id-13","text":"(2.6)","element":"a"},{"text":". We can see that all methods control the FDR at the nominal level. In terms of the power, BH can be improved by SABHA and GAP, both of which are dominated by the Clfdr rule. Clfdr captures the varying structure in the data stream more effectively: in addition to varied ","element":"span"},{"style":{"height":10.22},"width":36.88,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-11.png","element":"img","alt":" πt","inline":true},{"text":", it also adapts to ","element":"span"},{"style":{"height":16.4},"width":33.36,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/8-12.png","element":"img","alt":" ft","inline":true},{"text":", leading to further power improvement.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"2.4 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"A new alpha–investing framework","element":"span"}],[{"text":"Existing FDR methods such as the BH and Clfdr procedures are simultaneous inference procedures that involve first ordering the significance indices (","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-value or Clfdr) of all hypotheses and then applying a step-wise algorithm to the ordered sequence to determine the threshold. However, the ranking and thresholding strategy cannot be applied to the online setting where the investigator must make real–time decisions without seeing future observations. This section discusses how to avoid the overflow of the FDR at any given time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"and how to efficiently","element":"span"}],[{"id":"id-22","style":{"width":"92%"},"width":1648,"height":1090,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/9-0.png","element":"img"}],[{"text":"Figure 1: ","element":"figcaption","subtype":"caption"},{"text":"Structure–adaptiveness: Clfdr vs weighted p-values.","element":"figcaption","subtype":"caption"}],[{"text":"allocate the alpha–wealth to increase the power.","element":"span"}],[{"text":"We start with a novel interpretation of the alpha–investing idea by recasting the Clfdr algorithm ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"text":"as a varying–capacity knapsack process. Denote ","element":"span"},{"style":{"height":17.6},"width":523.27,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/9-1.png","element":"img","alt":" Rt ⊂ {H1, H2, · · · , Ht} the","inline":true,"padRight":true},{"text":"collection of rejected hypotheses at time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". The decision process ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"text":"can be conceptualized as a sequence of comparisons of two quantities: the nominal FDR level ","element":"span"},{"style":{"height":8.4},"width":28,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/9-2.png","element":"img","alt":" α","inline":true,"padRight":true},{"text":"and the average of","element":"span"}],[{"text":"the rejected Clfdr values. Specifically, ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"text":"motivates us to consider the constraint","element":"span"}],[{"id":"id-23","style":{"width":"71%"},"width":1264,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/9-3.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"Ave","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"A","element":"span"},{"text":") denotes the average of the elements in set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"A","element":"span"},{"text":". The simultaneous testing setup is only concerned with one constraint at the last time point when all data have been observed. By contrast, the online setup poses a series of constraints, e.g. ","element":"span"},{"href":"#id-23","text":"(2.8) ","element":"a"},{"text":"must be fulfilled for every ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"to avoid the overflow of FDR","element":"span"},{"href":"#id-12","style":{"height":19.28},"width":130.49,"height":48.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/9-4.png","element":"img","alt":"t (2.2).","inline":true}],[{"text":"We view ","element":"span"},{"href":"#id-23","text":"(2.8) ","element":"a"},{"text":"as a dynamic decision process resembling a knapsack problem, where ","element":"span"},{"style":{"height":14.62},"width":127.96,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/9-5.png","element":"img","alt":" Ht can","inline":true}],[{"text":"only be rejected when the following constraint is satisfied:","element":"span"}],[{"id":"id-24","style":{"width":"80%"},"width":1438,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-0.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":16.4},"width":344.75,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-1.png","element":"img","alt":" Ct is the capacity","inline":true,"padRight":true},{"text":"(of the knapsack) at time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"with the default choice ","element":"span"},{"style":{"height":15.02},"width":263.34,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-2.png","element":"img","alt":" C1 = 0. The","inline":true,"padRight":true},{"text":"capacity may either expand or shrink over time, depending on the sequential decisions along the data stream. This dynamic process can be described as follows. The initial capacity is ","element":"span"},{"style":{"height":15.02},"width":48.19,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-3.png","element":"img","alt":"C1","inline":true,"padRight":true},{"text":"= 0. Starting from ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"= 1, we reject ","element":"span"},{"style":{"height":14.62},"width":48.27,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-4.png","element":"img","alt":" Ht","inline":true,"padRight":true},{"text":"if ","element":"span"},{"href":"#id-24","text":"(2.9) ","element":"a"},{"text":"is fulfilled. If ","element":"span"},{"style":{"height":14.62},"width":48.27,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-5.png","element":"img","alt":" Ht","inline":true,"padRight":true},{"text":"with Clfdr","element":"span"},{"style":{"height":12.22},"width":100.42,"height":30.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-6.png","element":"img","alt":"t < α","inline":true,"padRight":true},{"text":"is rejected, then the capacity ","element":"span"},{"style":{"height":15.02},"width":43.19,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-7.png","element":"img","alt":" Ct","inline":true,"padRight":true},{"text":"increases by ","element":"span"},{"style":{"height":15.02},"width":192.3,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-8.png","element":"img","alt":" α − Clfdrt","inline":true,"padRight":true},{"text":"(gain); hence we earn bonus room. By contrast, if ","element":"span"},{"style":{"height":14.62},"width":48.27,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-9.png","element":"img","alt":" Ht","inline":true,"padRight":true},{"text":"with Clfdr","element":"span"},{"style":{"height":12.22},"width":100.42,"height":30.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-10.png","element":"img","alt":"t > α","inline":true,"padRight":true},{"text":"is rejected, then ","element":"span"},{"style":{"height":15.02},"width":43.19,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-11.png","element":"img","alt":" Ct","inline":true,"padRight":true},{"text":"decreases by ","element":"span"},{"style":{"height":17.6},"width":322.76,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-12.png","element":"img","alt":" α − Clfdrt (loss).","inline":true}],[{"text":"The decision process ","element":"span"},{"href":"#id-24","text":"(2.9) ","element":"a"},{"text":"provides a new alpha–investing framework that precisely characterizes the gains and losses in sequential testing. In contrast with the alpha–investing framework in ","element":"span"},{"href":"#id-3","referenceIndex":9,"text":"Foster and Stine ","element":"a"},{"href":"#id-3","referenceIndex":9,"text":"(2008)","element":"a"},{"text":", which views each rejection as a gain of extra alpha–wealth, the new characterization ","element":"span"},{"href":"#id-24","text":"(2.9) ","element":"a"},{"text":"reveals that not all rejections are created equal: rejections with small Clfdr will lead to increased alpha–wealth whereas rejections with large Clfdr will lead to decreased alpha–wealth. This view provides key insights for designing more powerful online FDR rules. Moreover, the new AI framework reveals that utilizing Clfdr rules can automatically avoid the “alpha–death” issue. ","element":"span"},{"text":"Specifically, the process ","element":"span"},{"href":"#id-24","text":"(2.9) ","element":"a"},{"text":"can always reject new hypotheses with Clfdr ","element":"span"},{"style":{"height":10.4},"width":77.88,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-13.png","element":"img","alt":" < α","inline":true,"padRight":true},{"text":"regardless of the current budget, and can proceed in an ongoing manner to any time point in the future.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"2.5 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Oracle–assisted adaptive learning and the SAST algorithm","element":"span"}],[{"text":"To efficiently allocate the alpha–wealth, we need to further refine the online algorithm ","element":"span"},{"href":"#id-24","text":"(2.9) ","element":"a"},{"text":"to avoid making imprudent rejections that can potentially eat up all the budget. The specific issue is referred to as “piggybacking” ","element":"span"},{"href":"#id-5","referenceIndex":19,"text":"(Ramdas et al., ","element":"a"},{"href":"#id-5","referenceIndex":19,"text":"2017)","element":"a"},{"text":", which, in a vivid way, describes the phenomenon that a string of bad decisions were made due to previously acquired budget.","element":"span"}],[{"text":"To see the necessity of taking careful actions, suppose that we have accumulated some bonus room over time before observing a very large Clfdr","element":"span"},{"style":{"height":8},"width":12,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-14.png","element":"img","alt":"t","inline":true,"padRight":true},{"text":"satisfying ","element":"span"},{"href":"#id-24","text":"(2.9)","element":"a"},{"text":". Although rejecting ","element":"span"},{"style":{"height":14.62},"width":48.27,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/10-15.png","element":"img","alt":"Ht","inline":true,"padRight":true},{"text":"is an action that obeys the FDR constraint, the action can be unwise since it is possible that we can invest the extra “cost”, Clfdr","element":"span"},{"style":{"height":10.62},"width":92.76,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-0.png","element":"img","alt":"t − α","inline":true},{"text":", to make more discoveries at later time points.","element":"span"}],[{"text":"A practical strategy is to incorporate a “barrier” ","element":"span"},{"style":{"height":11.6},"width":34.59,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-1.png","element":"img","alt":" γt","inline":true,"padRight":true},{"text":"and modify ","element":"span"},{"href":"#id-24","text":"(2.9) ","element":"a"},{"text":"as","element":"span"}],[{"id":"id-27","style":{"width":"77%"},"width":1381,"height":101,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-2.png","element":"img"}],[{"text":"The barrier can effectively prevent “piggybacking” by filtering out large Clfdr","element":"span"},{"style":{"height":15.02},"width":233.3,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-3.png","element":"img","alt":"t and hence","inline":true,"padRight":true},{"text":"saving budget for future.","element":"span"}],[{"text":"The choice of ","element":"span"},{"style":{"height":11.6},"width":34.59,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-4.png","element":"img","alt":" γt","inline":true,"padRight":true},{"text":"depends on the pattern of future hypotheses. However, all online methods must proceed without seeing the future. To resolve the issue, consider the oracle Clfdr rule ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"text":"that sees all data in a local neighborhood at once. If we assume that the hypothesis stream is “locally stable” in its patterns, then ","element":"span"},{"style":{"height":11.6},"width":34.59,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-5.png","element":"img","alt":" γt","inline":true,"padRight":true},{"text":"may be informed by the oracle rule ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"style":{"fontStyle":"italic"},"text":"simultaneously ","element":"span"},{"text":"conducted on a local neighborhood ","element":"span"},{"style":{"height":18.4},"width":482.38,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-6.png","element":"img","alt":" Nd(t) = {t − d − 1, · · · , t}","inline":true},{"text":". The rationale is to use recent past data to get some ideas about the patterns of hypotheses to arrive in the near future. Concretely, we first order ","element":"span"},{"style":{"height":18.4},"width":303.27,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-7.png","element":"img","alt":" {Hi : i ∈ Nd(t)}","inline":true,"padRight":true},{"text":"according to their Clfdr values, then run the “offline” algorithm ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"text":"to set the barrier ","element":"span"},{"style":{"height":19.15},"width":289.53,"height":47.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-8.png","element":"img","alt":" γt = Clfdr(k+1)","inline":true},{"text":". The online algorithm, by acting as if it sees the future, can effectively filter out large Clfdr values and hence avoid inefficient investments. The operation of algorithm ","element":"span"},{"href":"#id-13","text":"(2.6) ","element":"a"},{"text":"also implies that the barrier ","element":"span"},{"style":{"height":11.6},"width":34.59,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-9.png","element":"img","alt":" γt","inline":true,"padRight":true},{"text":"may be either raised or lowered according to the varied ","element":"span"},{"style":{"height":16.4},"width":168.08,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-10.png","element":"img","alt":" πt and ft","inline":true,"padRight":true},{"text":"in the dynamic model, which is desirable in practice for dealing with inhomogeneous data streams. In Section ","element":"span"},{"href":"#id-25","text":"4.3, ","element":"a"},{"text":"we illustrate that the incorporating of the barrier can greatly reduce the MDR ","element":"span"},{"href":"#id-26","text":"(2.3)","element":"a"},{"text":".","element":"span"}],[{"text":"Finally, we present the proposed structure–adaptive sequential testing (SAST) rule (oracle version with known parameters) in Algorithm 1. The SAST algorithm essentially utilizes the sequential constraints ","element":"span"},{"href":"#id-27","text":"(2.10) ","element":"a"},{"text":"with barriers set by the offline algorithm ","element":"span"},{"href":"#id-13","text":"(2.6)","element":"a"},{"text":".","element":"span"}],[{"text":"We can see that Algorithm 1 runs two parallel procedures: an online procedure for making real–time decisions and an “offline” procedure for determining the barrier. Thus the information of every data point has been used twice: first ","element":"span"},{"style":{"height":14.62},"width":48.15,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-11.png","element":"img","alt":" Xt","inline":true,"padRight":true},{"text":"is used for real–time decision–making at time ","element":"span"},{"style":{"height":15.6},"width":191.4,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-12.png","element":"img","alt":" t, then Xt","inline":true,"padRight":true},{"text":"is stored as past data so that we can “learn from experiences” via the offline oracle. The following theorem shows that Algorithm 1 is valid for online FDR control.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Theorem 1. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider the online FDR procedure ","element":"span"},{"style":{"height":17.6},"width":479.55,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/11-13.png","element":"img","alt":" δδδ = (δt : t ∈ T), where δt","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is determined by","element":"span"}],[{"style":{"width":"96%"},"width":1717,"height":578,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/12-0.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Algorithm 1. Denote ","element":"span"},{"style":{"height":18.73},"width":423.06,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/12-1.png","element":"img","alt":" δδδt = (δi : i ≤ t; i ∈ T)","inline":true},{"style":{"fontStyle":"italic"},"text":". Assume that the Clfdr values are known. Then we have FDR","element":"span"},{"style":{"height":19.28},"width":445.47,"height":48.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/12-2.png","element":"img","alt":"t(δδδt) ≤ α, for all t ∈ T.","inline":true}]]},{"heading":"3 Data-Driven SAST and Its Theoretical Properties","paragraphs":[[{"text":"We first develop estimation methodologies and computational algorithms to implement the SAST rule in Section ","element":"span"},{"href":"#id-28","text":"3.1, ","element":"a"},{"text":"then establish the theoretical properties of the data-driven procedure in Section ","element":"span"},{"href":"#id-29","text":"3.2.","element":"a"}],[{"id":"id-28","style":{"fontWeight":"bold"},"text":"3.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Data-driven procedure and computational algorithms","element":"span"}],[{"text":"We assume that the null distribution of ","element":"span"},{"style":{"height":16.4},"width":200.98,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/12-3.png","element":"img","alt":" z-values f0","inline":true,"padRight":true},{"text":"is known, which is a standard practice in the literature","element":"span"},{"style":{"height":8.4},"width":17,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/12-4.png","element":"img","alt":"2","inline":true},{"text":". The key quantities remained to be estimated are ","element":"span"},{"style":{"height":17.6},"width":216.85,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/12-5.png","element":"img","alt":" πt and ft(x","inline":true},{"text":"). In our motivating applications such as queries of QPDs and anomaly detection in high–frequency time series, the databases or servers have already collected large amounts of data at the beginning of the online FDR analysis. Let ","element":"span"},{"style":{"height":17.67},"width":413.62,"height":44.18,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/12-6.png","element":"img","alt":" {X−K0, · · · , X−1, X0}","inline":true,"padRight":true},{"text":"denote the available data and suppose we start online testing at ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"= 1 with a data stream ","element":"span"},{"style":{"height":19.13},"width":274.42,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/12-7.png","element":"img","alt":" {X1, X2, . . .}3.","inline":true}],[{"text":"The conditional density ","element":"span"},{"style":{"height":16.4},"width":33.36,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/12-8.png","element":"img","alt":" ft","inline":true,"padRight":true},{"text":"can be estimated using standard (one–sided) bivariate kernel","element":"span"}],[{"text":"methods ","element":"span"},{"href":"#id-30","referenceIndex":22,"text":"(Silverman, ","element":"a"},{"href":"#id-30","referenceIndex":22,"text":"1986)","element":"a"},{"text":":","element":"span"}],[{"id":"id-34","style":{"width":"72%"},"width":1295,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-0.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.02},"width":148.44,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-1.png","element":"img","alt":" d ≤ K0","inline":true,"padRight":true},{"text":"is the length of the moving window that includes a pre-specified number of observations, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"K","element":"span"},{"text":"(","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":") is a kernel function, ","element":"span"},{"style":{"height":15.02},"width":173.62,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-2.png","element":"img","alt":" ht and hx","inline":true,"padRight":true},{"text":"are the bandwidths, with ","element":"span"},{"style":{"height":19.13},"width":385.71,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-3.png","element":"img","alt":" Kh(t) = h−1K(t/h).","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Remark 3. ","element":"span"},{"text":"In analysis of large-scale high-frequency time series data such as the NYC taxi","element":"span"}],[{"text":"data (Section ","element":"span"},{"text":"5)","element":"span"},{"text":", we can pre-specify ","element":"span"},{"style":{"fontStyle":"italic"},"text":"d","element":"span"},{"text":", say, to be 1000 to speed up the computation. This virtually has no impact on the estimator ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":16.4},"width":33.36,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-4.png","element":"img","alt":"ft","inline":true,"padRight":true},{"text":"(compared to using all previous data). Otherwise we can always set ","element":"span"},{"style":{"fontStyle":"italic"},"text":"d ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". Note that our estimator has followed the standard practice in density estimation, which does not include ","element":"span"},{"style":{"height":14.62},"width":48.15,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-5.png","element":"img","alt":" Xt","inline":true,"padRight":true},{"text":"when estimating ","element":"span"},{"style":{"height":17.6},"width":289.5,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-6.png","element":"img","alt":" ft(x) at time t.","inline":true}],[{"text":"Next we propose a weighted screening approach to estimate the unknown proportion ","element":"span"},{"style":{"height":17.6},"width":85.04,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-7.png","element":"img","alt":" {πt :","inline":true},{"style":{"height":17.6},"width":123.04,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-8.png","element":"img","alt":"t ∈ T}","inline":true},{"text":". The key idea is to use a kernel, which weights observations by their distance to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":", to pool information from nearby time points. Let ","element":"span"},{"style":{"height":15.02},"width":37.14,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-9.png","element":"img","alt":" ht","inline":true,"padRight":true},{"text":"be the bandwidth","element":"span"},{"style":{"height":15.13},"width":156.69,"height":37.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-10.png","element":"img","alt":"4 and K","inline":true,"padRight":true},{"text":"a kernel function satisfying","element":"span"},{"style":{"height":20.28},"width":949.64,"height":50.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-11.png","element":"img","alt":"�K(t)dt = 1,�tK(t)dt = 0 and�t2K(t)dt < ∞","inline":true},{"text":". Consider a screening procedure ","element":"span"},{"style":{"height":17.6},"width":925.84,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-12.png","element":"img","alt":"Tt(τ) = {t − d + 1 ≤ i ≤ t − 1 : Pi > τ}, where τ","inline":true,"padRight":true},{"text":"is a pre-specified threshold. We propose the following estimator based on ","element":"span"},{"href":"#id-14","referenceIndex":6,"text":"Cai et al. ","element":"a"},{"href":"#id-14","referenceIndex":6,"text":"(2019)","element":"a"},{"text":":","element":"span"}],[{"id":"id-31","style":{"width":"70%"},"width":1252,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-13.png","element":"img"}],[{"text":"Now we provide some intuitions of the estimator ","element":"span"},{"href":"#id-31","text":"(3.2)","element":"a"},{"text":". First, at time ","element":"span"},{"style":{"height":17.6},"width":349.12,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-14.png","element":"img","alt":" t, define vh(t, i) =","inline":true},{"style":{"height":17.64},"width":303.67,"height":44.09,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-15.png","element":"img","alt":"Kht(|t − i|)/Kht","inline":true},{"text":"(0). We can view ","element":"span"},{"style":{"height":22.8},"width":413.02,"height":57.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-16.png","element":"img","alt":" mt = �t−1i=t−d+1 vh(t, i","inline":true},{"text":") as the “total” number of observations ","element":"span"},{"text":"at time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". Suppose we are interested in counting how many null ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values are greater than ","element":"span"},{"style":{"height":8},"width":23,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-17.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"among the ","element":"span"},{"style":{"height":10.62},"width":50.31,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-18.png","element":"img","alt":" mt","inline":true,"padRight":true},{"text":"“observations” at ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". The empirical count is given by ","element":"span"},{"style":{"height":19.95},"width":236.61,"height":49.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-19.png","element":"img","alt":"�i∈Tτ vh(t, i","inline":true},{"text":"), whereas the ","element":"span"},{"text":"expected count is given by ","element":"span"},{"style":{"height":22.8},"width":667.8,"height":57.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-20.png","element":"img","alt":" {�t−1i=t−d+1 vh(t, i)}{1 − πt}(1 − τ).","inline":true,"padRight":true},{"text":"Equation ","element":"span"},{"href":"#id-31","text":"(3.2) ","element":"a"},{"text":"can be derived ","element":"span"},{"text":"by first setting equal the expected and empirical counts and then solving for ","element":"span"},{"style":{"height":10.22},"width":36.88,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/13-21.png","element":"img","alt":" πt","inline":true},{"text":". In Section","element":"span"}],[{"id":"id-35","style":{"width":"99%"},"width":1775,"height":200,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-0.png","element":"img"}],[{"text":"which always underestimates ","element":"span"},{"style":{"height":10.22},"width":36.88,"height":25.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-1.png","element":"img","alt":" πt","inline":true,"padRight":true},{"text":"and guarantees (conservative) FDR control (Propositions ","element":"span"},{"href":"#id-32","text":"1)","element":"a"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Remark 4. ","element":"span"},{"text":"There is a bias-variance tradeoff in the choice of ","element":"span"},{"style":{"height":8},"width":23,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-2.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"for the proposed estimator ˆ","element":"span"},{"style":{"height":16.25},"width":59.14,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-3.png","element":"img","alt":"πτt .","inline":true}],[{"text":"We shall see that when ","element":"span"},{"style":{"height":8},"width":23,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-4.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"increases, the “purity” of the screening subset ","element":"span"},{"style":{"height":17.6},"width":74.83,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-5.png","element":"img","alt":" T (τ","inline":true},{"text":") increases, which decreases the approximation bias of ","element":"span"},{"style":{"height":16.25},"width":44.44,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-6.png","element":"img","alt":" πτt ","inline":true,"padRight":true},{"text":"(desirable). At the same time, when ","element":"span"},{"style":{"height":8},"width":23,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-7.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"increases, the ","element":"span"},{"text":"sample size for estimating ","element":"span"},{"style":{"height":16.25},"width":44.44,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-8.png","element":"img","alt":" πτt ","inline":true,"padRight":true},{"text":"will decrease, thereby increasing the variance of the estimator ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":16.25},"width":44.44,"height":40.62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-9.png","element":"img","alt":"πτt ","inline":true,"padRight":true},{"text":"(undesirable). The common choice of ","element":"span"},{"style":{"height":8},"width":23,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-10.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"is 0.5. In Section ","element":"span"},{"href":"#id-33","text":"4.1, ","element":"a"},{"text":"we discuss a data–driven ","element":"span"},{"text":"algorithm that chooses ","element":"span"},{"style":{"height":8},"width":23,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-11.png","element":"img","alt":" τ","inline":true,"padRight":true},{"text":"adaptively.","element":"span"}],[{"text":"Combining ","element":"span"},{"href":"#id-34","text":"(3.1) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-31","text":"(3.2)","element":"a"},{"text":", we propose to estimate the Clfdr as","element":"span"}],[{"style":{"width":"72%"},"width":1296,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-12.png","element":"img"}],[{"text":"Our proposed data-driven rule implements Algorithm 1 by substituting ","element":"span"},{"style":{"height":15.02},"width":110.3,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-13.png","element":"img","alt":"�Clfdrt","inline":true,"padRight":true},{"text":"in place of Clfdr","element":"span"},{"style":{"height":8},"width":12,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-14.png","element":"img","alt":"t","inline":true},{"text":". The data-driven algorithm is summarized in Algorithm 2.","element":"span"}],[{"style":{"width":"92%"},"width":1653,"height":794,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/14-15.png","element":"img"}],[{"id":"id-29","style":{"fontWeight":"bold"},"text":"3.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Theoretical properties of data–driven SAST","element":"span"}],[{"text":"This section aims to show that the data–driven SAST procedure is asymptotically valid for online FDR control. Our theoretical analysis is divided into three steps. The first step (Propo-","element":"span"}],[{"text":"sition ","element":"span"},{"href":"#id-32","text":"1) ","element":"a"},{"text":"shows that a hypothetical rule, which substitutes","element":"span"}],[{"id":"id-36","style":{"width":"62%"},"width":1113,"height":104,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-0.png","element":"img"}],[{"text":"in place of Clfdr","element":"span"},{"style":{"height":8},"width":12,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-1.png","element":"img","alt":"t","inline":true,"padRight":true},{"text":"in Algorithm 1, is conservative for online FDR control.","element":"span"}],[{"id":"id-32","href":"#id-35","style":{"height":17.6},"width":897.11,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-2.png","element":"img","alt":"Proposition 1. Consider πτt defined by (3.3)","inline":true},{"style":{"fontStyle":"italic"},"text":", then we have ","element":"span"},{"style":{"height":16.99},"width":576.64,"height":42.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-3.png","element":"img","alt":" πτt ≤ πt and Clfdrt ≤ Clfdrτt .","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"Hence the hypothetical rule using ","element":"span"},{"href":"#id-36","text":"(3.5) ","element":"a"},{"style":{"fontStyle":"italic"},"text":"is valid (and conservative) for online FDR control.","element":"span"}],[{"text":"The second step (Proposition ","element":"span"},{"href":"#id-37","text":"2) ","element":"a"},{"text":"shows that ","element":"span"},{"style":{"height":15.02},"width":110.3,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-4.png","element":"img","alt":"�Clfdrt","inline":true,"padRight":true},{"text":"is a consistent estimator of Clfdr","element":"span"},{"style":{"height":16.99},"width":32.7,"height":42.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-5.png","element":"img","alt":"τt .","inline":true,"padRight":true},{"text":"We prove the result by appealing to the infill–asymptotics framework ","element":"span"},{"href":"#id-38","referenceIndex":23,"text":"(Stein, ","element":"a"},{"href":"#id-38","referenceIndex":23,"text":"2012)","element":"a"},{"text":", which converts the set of time points ","element":"span"},{"style":{"height":17.6},"width":226.85,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-6.png","element":"img","alt":" {1, 2, · · · , t}","inline":true,"padRight":true},{"text":"on a growing domain to a set of points that lie on a fixed-domain regular grid: ","element":"span"},{"style":{"height":21.29},"width":297.66,"height":53.23,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-7.png","element":"img","alt":" { 1t , 2t , ..., t−1t , 1}","inline":true},{"text":". The discussions in ","element":"span"},{"href":"#id-38","referenceIndex":23,"text":"Stein ","element":"a"},{"href":"#id-38","referenceIndex":23,"text":"(2012) ","element":"a"},{"text":"indicate that ","element":"span"},{"text":"the in-fill model is equivalent to the growing domain model under mild conditions: When ","element":"span"},{"style":{"height":11.6},"width":131.84,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-8.png","element":"img","alt":"t → ∞","inline":true},{"text":", the asymptotic arguments, which respectively correspond to letting the grid become denser and denser in the fixed interval (0","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"1] and letting the domain ","element":"span"},{"style":{"height":17.6},"width":226.85,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-9.png","element":"img","alt":" {1, 2, · · · , t}","inline":true,"padRight":true},{"text":"to grow to infinity, can be essentially established in the same manner. We state the fixed domain theory as it naturally connects to the familiar density estimation theory, where the notations and regularity conditions are standard and easy to understand. The growing domain version of the theory is briefly discussed in Appendix ","element":"span"},{"href":"#id-39","text":"B.3.","element":"a"}],[{"text":"We can similarly define the bivariate density estimator and the following conditional proportion estimator:","element":"span"}],[{"id":"id-40","style":{"width":"71%"},"width":1274,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-10.png","element":"img"}],[{"text":"The two estimators ","element":"span"},{"href":"#id-31","text":"(3.2) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-40","text":"(3.6) ","element":"a"},{"text":"are essentially identical (with rescaled bandwidths).","element":"span"}],[{"text":"We state the following regularity conditions. Condition (A1) requires that ","element":"span"},{"style":{"height":17.6},"width":77.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/15-11.png","element":"img","alt":" ft(x","inline":true},{"text":") is smooth in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". Conditions (A2) to (A4) are standard in density estimation theory; see, for example, ","element":"span"},{"href":"#id-41","referenceIndex":25,"text":"(Wand and Jones, ","element":"a"},{"href":"#id-41","referenceIndex":25,"text":"1994)","element":"a"},{"text":".","element":"span"}],[{"style":{"height":17.6},"width":775.22,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-0.png","element":"img","alt":"(A1): For any s ∈ (0, 1] and ϵ > 0, ∃δ","inline":true,"padRight":true},{"text":"such that if ","element":"span"},{"style":{"height":19.6},"width":742.99,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-1.png","element":"img","alt":" |s − s′| ≤ δ, s′ ∈ (0, 1] then�|fs(x) −","inline":true}],[{"style":{"fontStyle":"italic"},"text":"f","element":"span"},{"style":{"height":17.6},"width":234.94,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-2.png","element":"img","alt":"s′(x)|dx < ϵ.","inline":true}],[{"style":{"height":17.6},"width":749.44,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-3.png","element":"img","alt":"(A2): hx → 0, ht → 0 and thxht → ∞.","inline":true}],[{"style":{"width":"50%"},"width":897,"height":60,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-4.png","element":"img"}],[{"style":{"height":17.6},"width":863.01,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-5.png","element":"img","alt":"(A4): dht → ∞ and d ≥ ctht for some c > 0.","inline":true}],[{"id":"id-37","style":{"fontWeight":"bold"},"text":"Proposition 2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose (A1)–(A4) hold, then ","element":"span"},{"style":{"height":20.74},"width":307.73,"height":51.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-6.png","element":"img","alt":"�Clfdrt p−→ Clfdrτt .","inline":true}],[{"text":"In the third step of our theoretical analysis (Theorem ","element":"span"},{"href":"#id-42","text":"2)","element":"a"},{"text":", we establish the asymptotic validity of the data-driven SAST procedure for online FDR control.","element":"span"}],[{"id":"id-42","style":{"fontWeight":"bold"},"text":"Theorem 2. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume the conditions in Proposition ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"2 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"hold. Then for any given time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"style":{"fontStyle":"italic"},"text":", the data-driven SAST rule (Algorithm 2) controls the FDR","element":"span"},{"style":{"height":15.28},"width":208.38,"height":38.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-7.png","element":"img","alt":"t at level α","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"asymptotically.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"3.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Theory for data streams with fixed distributions","element":"span"}],[{"text":"SAST learns from past decisions and improves its performance over time through the assistance from an offline oracle. The barrier ","element":"span"},{"style":{"height":11.6},"width":34.59,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-8.png","element":"img","alt":" γt","inline":true,"padRight":true},{"text":"would become more informative as more tests are conducted. Specifically, the initial barrier is set to be ","element":"span"},{"style":{"height":12},"width":226.24,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-9.png","element":"img","alt":" α at time t","inline":true,"padRight":true},{"text":"= 1, which is very conservative. In the special case when the mixture model has fixed ","element":"span"},{"style":{"height":16.4},"width":176.64,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-10.png","element":"img","alt":" πt and ft","inline":true,"padRight":true},{"text":"over time, we can show that the barrier ","element":"span"},{"style":{"height":11.6},"width":34.59,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-11.png","element":"img","alt":" γt","inline":true,"padRight":true},{"text":"would converge to ","element":"span"},{"style":{"height":16},"width":310.73,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-12.png","element":"img","alt":" γOR, where γOR","inline":true,"padRight":true},{"text":"is the optimal threshold of the “offline” oracle procedure in Section ","element":"span"},{"href":"#id-43","text":"2.2. ","element":"a"},{"text":"Hence, provided that the capacity allows, the operation of ","element":"span"},{"href":"#id-27","text":"(2.10) ","element":"a"},{"text":"implies that SAST behaves like an oracle that sees all data points (including future ones). Our numerical results show that the FDR levels of SAST are conservative at the beginning but the FDR becomes closer to ","element":"span"},{"style":{"height":8.4},"width":28,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-13.png","element":"img","alt":" α","inline":true,"padRight":true},{"text":"as we sequentially update the barrier with information from more time points.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Theorem 3. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Assume conditions from Theorem ","element":"span"},{"href":"#id-42","style":{"fontStyle":"italic"},"text":"2 ","element":"a"},{"style":{"fontStyle":"italic"},"text":"holds. Then under the model with ","element":"span"},{"style":{"height":11.02},"width":128.39,"height":27.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-14.png","element":"img","alt":" πt ≡ π","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"and ","element":"span"},{"style":{"height":16.4},"width":118.77,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-15.png","element":"img","alt":" ft ≡ f","inline":true},{"style":{"fontStyle":"italic"},"text":", the data-driven barrier ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":16},"width":655.06,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/16-16.png","element":"img","alt":"γt → γOR when t → ∞, where γOR","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is the optimal threshold of the oracle FDR procedure for simultaneous testing defined in Section ","element":"span"},{"href":"#id-43","style":{"fontStyle":"italic"},"text":"2.2.","element":"a"}]]},{"heading":"4 Simulation","paragraphs":[[{"text":"In this section, we first provide some details in implementation. Simulation studies are conducted in Section 4.2 to compare the oracle and data-driven SAST procedures with other existing online FDR rules. Section 4.3 presents an example to illustrate the merit of including a barrier in online sequential testing.","element":"span"}],[{"id":"id-33","style":{"fontWeight":"bold"},"text":"4.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Implementation Details","element":"span"}],[{"text":"In our simulation, the conditional density function ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":17.6},"width":77.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/17-0.png","element":"img","alt":"ft(x","inline":true},{"text":") is estimated using R function ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"density","element":"span"},{"text":", where the bandwidths ","element":"span"},{"style":{"height":15.02},"width":189.74,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/17-1.png","element":"img","alt":" hx and ht","inline":true,"padRight":true},{"text":"are chosen based on ","element":"span"},{"href":"#id-30","referenceIndex":22,"text":"Silverman ","element":"a"},{"href":"#id-30","referenceIndex":22,"text":"(1986)","element":"a"},{"text":". A key step in the SAST algorithm is to estimate ˆ","element":"span"},{"style":{"height":12.33},"width":44.44,"height":30.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/17-2.png","element":"img","alt":"πτ","inline":true},{"text":". We propose to choose a data-driven ","element":"span"},{"style":{"height":10.3},"width":76.18,"height":25.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/17-3.png","element":"img","alt":" τBH","inline":true,"padRight":true},{"text":"by running BH at ","element":"span"},{"style":{"height":12},"width":121.96,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/17-4.png","element":"img","alt":" α = 0.","inline":true},{"text":"5. Roughly speaking, in the subset ","element":"span"},{"text":"˜","element":"span"},{"style":{"height":17.6},"width":921.32,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/17-5.png","element":"img","alt":"Tt(τBH) = {t − d + 1 ≤ i ≤ t − 1 : Pi < τ}, 50%","inline":true,"padRight":true},{"text":"of the cases come from the null (e.g. the expected proportion of false positives made by BH). It is anticipated that in the remaining set ","element":"span"},{"style":{"height":17.6},"width":961.31,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/17-6.png","element":"img","alt":" Tt(τ) = {t − d + 1 ≤ i ≤ t − 1 : Pi > τBH}, which","inline":true,"padRight":true},{"text":"is used to construct our estimator, majority of the cases should come from the null. This data-driven scheme ensures a small bias in approximation, while maintaining a larger sample size compared to the standard choice of ","element":"span"},{"style":{"height":12.4},"width":149.95,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/17-7.png","element":"img","alt":" τ = 0.5.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"4.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Comparisons of online FDRs and MDRs","element":"span"}],[{"text":"We compare the proposed SAST procedure with its competitors for online FDR control. The following methods are included in the comparison:","element":"span"}],[{"text":"• ","element":"span"},{"text":"SAST with known ","element":"span"},{"style":{"height":16.4},"width":171.86,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/17-8.png","element":"img","alt":" πt and ft","inline":true,"padRight":true},{"text":"(SAST.OR, Algorithm 1)","element":"span"}],[{"text":"• ","element":"span"},{"text":"SAST with estimated model parameters (SAST.DD, Algorithm 2)","element":"span"}],[{"text":"• ","element":"span"},{"text":"LOND: the method proposed by Javanmard, A. and Montanari, A. (2016).","element":"span"}],[{"text":"• ","element":"span"},{"text":"LORD++: the GAI++ rule proposed by ","element":"span"},{"href":"#id-5","referenceIndex":19,"text":"Ramdas et al. ","element":"a"},{"href":"#id-5","referenceIndex":19,"text":"(2017)","element":"a"},{"text":".","element":"span"}],[{"text":"For the general simulation setup, we choose ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"= 5000 and the pre-specified FDR level","element":"span"}],[{"style":{"height":12},"width":120.08,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-0.png","element":"img","alt":"α = 0.","inline":true},{"text":"05. The data are simulated from the following model:","element":"span"}],[{"style":{"width":"35%"},"width":635,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-1.png","element":"img"}],[{"text":"For the data–driven method, we need an initial burn–in period. In simulation we generate 500 data points prior to ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"= 1 to form an initial density estimate. The varying density and proportion estimates are updated every 200 time points. The following simulation settings are considered:","element":"span"}],[{"style":{"width":"96%"},"width":1722,"height":217,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-2.png","element":"img"}],[{"text":"2. ","element":"span"},{"style":{"height":16.4},"width":1022.81,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-3.png","element":"img","alt":" Constant Pattern: πt = 0.05, t = 1, · · · , m. Vary µ","inline":true,"padRight":true},{"text":"from 2 to 4.2 with step size 0","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"2.","element":"span"}],[{"text":"3. ","element":"span"},{"style":{"height":16.4},"width":493.52,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-4.png","element":"img","alt":" Linear Pattern: Vary πt","inline":true,"padRight":true},{"text":"linearly from 0 to 0.5. Vary ","element":"span"},{"style":{"height":12},"width":26,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-5.png","element":"img","alt":" µ","inline":true,"padRight":true},{"text":"from 2 to 4.2 with step size 0","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"5.","element":"span"}],[{"text":"4. ","element":"span"},{"style":{"height":21.29},"width":764.21,"height":53.22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-6.png","element":"img","alt":" Sine Pattern: πt = (sin 2πtm + 1)/4, πt","inline":true,"padRight":true},{"text":"ranges between 0 to 0.5, vary ","element":"span"},{"style":{"height":12},"width":26,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-7.png","element":"img","alt":" µ","inline":true,"padRight":true},{"text":"from 2 to 4.2","element":"span"}],[{"style":{"width":"18%"},"width":338,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-8.png","element":"img"}],[{"text":"We apply different methods at ","element":"span"},{"style":{"height":12},"width":120.07,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-9.png","element":"img","alt":" α = 0.","inline":true},{"text":"05. The empirical FDR and MDR levels are evaluated using the average of the false discovery proportions and missed discovery proportions from 1000 replications. To investigate the performance of different methods in the online setting, we display the empirical FDR","element":"span"},{"style":{"height":15.28},"width":234.64,"height":38.2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-10.png","element":"img","alt":"t and MDRt ","inline":true,"padRight":true},{"text":"levels at various time points, where the intermediate evaluation points ranges from 1500 to 5000 with step size 500. ","element":"span"},{"text":"The results for block and constant patterns are summarized in Figure ","element":"span"},{"href":"#id-44","text":"2, ","element":"a"},{"text":"and the results for the linear and sine patterns are summarized in Figure ","element":"span"},{"href":"#id-45","text":"3.","element":"a"}],[{"text":"The following observations can be made from the simulation results.","element":"span"}],[{"style":{"width":"97%"},"width":1743,"height":220,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/18-11.png","element":"img"}],[{"id":"id-44","style":{"width":"100%"},"width":1786,"height":1776,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/19-0.png","element":"img"}],[{"text":"Figure 2: Simulation results for Settings 1 and 2: signal proportions are varied in a block fashion and kept constant respectively. Various signal strengths are investigated as well. Our data-driven and oracle procedures provide significantly more power while controlling FDR under the nominal level in comparison with others.","element":"figcaption","subtype":"caption"}],[{"id":"id-45","style":{"width":"100%"},"width":1786,"height":1776,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/20-0.png","element":"img"}],[{"text":"Figure 3: Simulation results for Settings 3 and 4: signal proportions are varied in linear and sine patterns, respectively. Our data-driven and oracle procedures provide significantly more power while controlling FDR under the nominal level in comparison with others.","element":"figcaption","subtype":"caption"}],[{"style":{"width":"98%"},"width":1748,"height":1155,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/21-0.png","element":"img"}],[{"id":"id-25","style":{"fontWeight":"bold"},"text":"4.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Effects of the barrier","element":"span"}],[{"text":"This section presents a toy example to illustrate that the barrier, which aims to prevent the “piggybacking” issue ","element":"span"},{"href":"#id-5","referenceIndex":19,"text":"(Ramdas et al., ","element":"a"},{"href":"#id-5","referenceIndex":19,"text":"2017)","element":"a"},{"text":", can greatly reduce the MDR by allocating existing alpha–wealth in a more cost–effective way. Consider the previous block structured setting (Setting 1 in Section 4.2). Figure ","element":"span"},{"href":"#id-46","text":"4 ","element":"a"},{"text":"shows the FDR and MDR comparisons for the following methods at FDR level ","element":"span"},{"style":{"height":12},"width":130.51,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/21-1.png","element":"img","alt":" α = 0.","inline":true},{"text":"05: (i) oracle SAST rule (OR); (ii) oracle SAST rule with no barrier (OR nob); (iii) data-driven SAST rule with estimated parameters (DD, Section 3); (iv) data-driven SAST rule with no barrier (DD nob).","element":"span"}],[{"text":"We can see from the comparison that although the FDR levels between the two oracle methods are roughly the same, the MDR levels are greatly reduced by incorporating the barrier (hence the alpha–wealth is invested more efficiently). The same patterns can be observed for the two data-driven procedures.","element":"span"}],[{"id":"id-46","style":{"width":"100%"},"width":1791,"height":586,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/22-0.png","element":"img"}],[{"text":"Figure 4: The incorporation of the barrier greatly reduces the MDR levels.","element":"figcaption","subtype":"caption"}]]},{"heading":"5 Applications","paragraphs":[[{"text":"Online FDR rules are useful for a wide range of scenarios. ","element":"span"},{"text":"We discuss two applications, respectively for anomaly detection in large–scale time series data and genotype discovery under the QPD framework.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"5.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Time series anomaly detection","element":"span"}],[{"text":"The NYC taxi dataset can be downloaded from the Numenta Anomaly Benchmark (NAB) repository ","element":"span"},{"href":"#id-47","referenceIndex":3,"text":"(Ahmad et al., ","element":"a"},{"href":"#id-47","referenceIndex":3,"text":"2017)","element":"a"},{"text":", which contains useful tools and datasets for evaluating algorithms for anomaly detection in streaming, real–time applications. The dataset records the counts of NYC taxi passengers every 30 minutes from July 1, 2014 to January 31, 2015, during which period five known anomalies had occurred (the NYC marathon, Thanksgiving, Christmas, New Years day and a snow storm). In Figure ","element":"span"},{"href":"#id-48","text":"5, ","element":"a"},{"text":"we plot the time series, with the known anomalous intervals displayed in red rectangles.","element":"span"}],[{"text":"We formulate the anomaly detection problem as an online sequential multiple testing problem. The basic setup can be described as follows. The null hypothesis ","element":"span"},{"style":{"height":14.62},"width":48.28,"height":36.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/22-1.png","element":"img","alt":" Ht","inline":true,"padRight":true},{"text":"corresponds to no anomaly at time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". We claim that an anomaly occurs at ","element":"span"},{"style":{"height":15.02},"width":117.02,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/22-2.png","element":"img","alt":" t if Ht","inline":true,"padRight":true},{"text":"is rejected. A rejection within the red intervals is considered to be a true discovery.","element":"span"}],[{"text":"The application of online FDR rules requires summarizing the stream of counts data as a sequence of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values or CLfdr statistics. However, directly calculating the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values based","element":"span"}],[{"id":"id-48","style":{"width":"99%"},"width":1777,"height":889,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/23-0.png","element":"img"}],[{"text":"Figure 5: NYC Taxi passenger count time series from July 1st 2014 to Jan 31st 2015. Blue lines are Loess smoothed time series indicating the overall trend change.","element":"figcaption","subtype":"caption"}],[{"text":"on this dataset would be problematic as the data demonstrate strong trend and seasonality patterns. We first use the R package ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"stlplus ","element":"span"},{"text":"to carry out an STL decomposition (Seasonal Trend decomposition using Loess smoothing; ","element":"span"},{"href":"#id-49","referenceIndex":7,"text":"Cleveland et al., ","element":"a"},{"href":"#id-49","referenceIndex":7,"text":"1990) ","element":"a"},{"text":"to remove the seasonal and trend components. The residuals, displayed in the top 3 rows of Figure ","element":"span"},{"href":"#id-50","text":"6, ","element":"a"},{"text":"are standardized and modeled using a two-component mixture ","element":"span"},{"href":"#id-11","text":"(2.1)","element":"a"},{"text":". However, as can be seen from the histogram at the bottom of Figure ","element":"span"},{"href":"#id-50","text":"6, ","element":"a"},{"text":"the null distribution is approximately normal but deviates from a standard normal. Following the method in ","element":"span"},{"href":"#id-51","referenceIndex":14,"text":"Jin and Cai ","element":"a"},{"href":"#id-51","referenceIndex":14,"text":"(2007)","element":"a"},{"text":", we estimate the empirical null distribution as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"N","element":"span"},{"text":"(0","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"028","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"text":"0","element":"span"},{"style":{"fontStyle":"italic"},"text":".","element":"span"},{"text":"618). We apply the BH (pretending all observations are seen at once), LOND, LORD++ and SAST.DD at FDR level 0.0001. For the SAST.DD method, the neighborhood size ","element":"span"},{"style":{"fontStyle":"italic"},"text":"d ","element":"span"},{"text":"and initial burn-in period are both chosen to be 500. In calculating the Clfdr, ","element":"span"},{"style":{"height":17.6},"width":82.26,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/23-1.png","element":"img","alt":" f0(x","inline":true},{"text":") is taken as the density of the estimated empirical null ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":14.62},"width":45.06,"height":36.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/23-2.png","element":"img","alt":"F0","inline":true},{"text":". Moreover, the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values are obtained by the formula ","element":"span"},{"style":{"height":21.21},"width":479.19,"height":53.03,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/23-3.png","element":"img","alt":" Pi = 2 ˆF0(−|Zi|), where z","inline":true},{"text":"-scores are computed based on the residuals. Figure ","element":"span"},{"href":"#id-52","text":"7 ","element":"a"},{"text":"summarizes the anomaly points detected by different methods.","element":"span"}],[{"text":"We can see that for the several anomaly time periods labeled, SAST can detect more points than other methods. Table ","element":"span"},{"href":"#id-53","text":"1 ","element":"a"},{"text":"summarizes the total number of rejections within the labeled time windows. It may appear counter-intuitive that SAST, being an online procedure, rejects","element":"span"}],[{"id":"id-53","style":{"width":"64%"},"width":1153,"height":290,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/24-0.png","element":"img"}],[{"text":"Table 1: Number of discoveries made by various online and offline FDR procedures for the NYC taxi dataset, nominal FDR level at 0","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":".","element":"figcaption","subtype":"caption"},{"text":"0001.","element":"figcaption","subtype":"caption"}],[{"text":"more null hypotheses than the offline BH procedure. The reason is that the anomalies tend to appear in clusters. This structural information is captured by the Clfdr statistic, which forms the building block of SAST and leads to improved power in detecting structured signals (Section ","element":"span"},{"href":"#id-54","text":"2.3)","element":"a"},{"text":".","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"5.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"IMPC dataset Genotype Discovery","element":"span"}],[{"text":"In this section, we demonstrate the SAST procedure on a real dataset from the International Mouse Phenotyping Consortium (IMPC). This dataset, which has been analyzed in ","element":"span"},{"href":"#id-55","referenceIndex":15,"text":"Karp et al. ","element":"a"},{"href":"#id-55","referenceIndex":15,"text":"(2017)","element":"a"},{"text":", involves a large study to functionally annotate every protein coding gene by exploring the impact of gene knockouts. This dataset and resulting family of hypotheses are constantly growing as new results come in. ","element":"span"},{"href":"#id-55","referenceIndex":15,"text":"Karp et al. ","element":"a"},{"href":"#id-55","referenceIndex":15,"text":"(2017) ","element":"a"},{"text":"tested both the roles of genotype and sex as modifiers of genotype effects, resulting in two sets of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values: one set for testing genotype effects, and the other for sexual dimorphism. This dataset has been widely used for comparing online FDR algorithms. Currently it is available as one of the application datasets in the R-package ","element":"span"},{"style":{"fontFamily":"monospace"},"text":"OnlineFDR ","element":"span"},{"text":"that implements methods such as LORD, LOND and LORD++. In order to implement our proposed SAST procedure, we need the original ","element":"span"},{"style":{"fontStyle":"italic"},"text":"z","element":"span"},{"text":"-scores instead of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values. However, the directions of effects cannot be determined based on ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-value alone. Hence, we transform the ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values into ","element":"span"},{"style":{"fontStyle":"italic"},"text":"z","element":"span"},{"text":"-scores by introducing a Bernoulli random variable to ensure asymptotic symmetry around 0: ","element":"span"},{"style":{"height":19.13},"width":1122.35,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/24-1.png","element":"img","alt":" z = XΦ−1(p/2) − (1 − X)Φ−1(p/2), where X ∼ Ber(0.5)5.","inline":true}],[{"text":"Table ","element":"span"},{"href":"#id-56","text":"2 ","element":"a"},{"text":"summarizes the total number of discoveries made by each method. We can see that SAST makes more discoveries than other alpha–investing methods. Similar to the analysis","element":"span"}],[{"id":"id-50","style":{"width":"97%"},"width":1729,"height":1582,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/25-0.png","element":"img"}],[{"text":"Figure 6: Top three rows: Time series of remainder component from STL decomposition with the known anomaly regions marked in red rectangles. Bottom row: Histogram of the remainder term from STL decomposition, the red curve indicates the estimated empirical null distribution ","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":"N","element":"figcaption","subtype":"caption"},{"text":"(0","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":".","element":"figcaption","subtype":"caption"},{"text":"028","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":", ","element":"figcaption","subtype":"caption"},{"text":"0","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":".","element":"figcaption","subtype":"caption"},{"text":"618).","element":"figcaption","subtype":"caption"}],[{"id":"id-52","style":{"width":"99%"},"width":1777,"height":1784,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/26-0.png","element":"img"}],[{"text":"Figure 7: Anomaly points detected by various algorithms, our data-driven SAST procedure detects the most anomaly points within the labeled window marked by red rectangles. Nominal significance level chosen as 0.0001.","element":"figcaption","subtype":"caption"}],[{"text":"in Section ","element":"span"},{"text":"5, ","element":"span"},{"text":"SAST rejects more hypotheses than the offline BH procedure. ","element":"span"},{"text":"One possible explanation is that Clfdr is more powerful than ","element":"span"},{"style":{"fontStyle":"italic"},"text":"p","element":"span"},{"text":"-values since it captures useful structural information in the data stream.","element":"span"}],[{"id":"id-56","style":{"width":"60%"},"width":1079,"height":337,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/27-0.png","element":"img"}],[{"text":"Table 2: Number of discoveries made by various online and offline FDR procedures for the IMPC dataset on Genotypes, nominal FDR level at 0","element":"figcaption","subtype":"caption"},{"style":{"fontStyle":"italic"},"text":".","element":"figcaption","subtype":"caption"},{"text":"05.","element":"figcaption","subtype":"caption"}]]},{"heading":"References","paragraphs":[[{"id":"id-1","text":"Aharoni, E., Neuvirth, H., and Rosset, S. (2010). The quality preserving database: A com- ","element":"span"},{"text":"putational framework for encouraging collaboration, enhancing power and controlling false discovery. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"IEEE/ACM transactions on computational biology and bioinformatics","element":"span"},{"text":", 8(5):1431– 1437.","element":"span"}],[{"id":"id-4","text":"Aharoni, E. and Rosset, S. (2014). Generalized ","element":"span"},{"style":{"height":8.4},"width":28,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/27-1.png","element":"img","alt":" α","inline":true},{"text":"-investing: definitions, optimality results and application to public databases. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Journal of the Royal Statistical Society: Series B (Statistical Methodology)","element":"span"},{"text":", 76(4):771–794.","element":"span"}],[{"id":"id-47","text":"Ahmad, S., Lavin, A., Purdy, S., and Agha, Z. (2017). ","element":"span"},{"text":"Unsupervised real-time anomaly detection for streaming data. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Neurocomputing","element":"span"},{"text":", 262:134–147.","element":"span"}],[{"id":"id-0","text":"Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: a practical and ","element":"span"},{"text":"powerful approach to multiple testing. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J. Roy. Statist. Soc. B","element":"span"},{"text":", ","element":"span"},{"style":{"fontWeight":"bold"},"text":"57","element":"span"},{"text":":289–300.","element":"span"}],[{"id":"id-16","text":"Cai, T. T. and Sun, W. (2009). Simultaneous testing of grouped hypotheses: Finding needles ","element":"span"},{"text":"in multiple haystacks. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J. Amer. Statist. Assoc.","element":"span"},{"text":", 104:1467–1481.","element":"span"}],[{"id":"id-14","text":"Cai, T. T., Sun, W., and Wang, W. (2019). CARS: Covariate assisted ranking and screening ","element":"span"},{"text":"for large-scale two-sample inference (with discussion). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J. Roy. Statist. Soc. B","element":"span"},{"text":", 81(2):187–234.","element":"span"}],[{"id":"id-49","text":"Cleveland, R. B., Cleveland, W. S., McRae, J. E., and Terpenning, I. (1990). Stl: a seasonal- ","element":"span"},{"text":"trend decomposition. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Journal of official statistics","element":"span"},{"text":", 6(1):3–73.","element":"span"}],[{"text":"Efron, B. (2004). Large-scale simultaneous hypothesis testing: The choice of a null hypothesis. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Journal of the American Statistical Association","element":"span"},{"text":", 99(465):96–104.","element":"span"}],[{"id":"id-3","text":"Foster, D. P. and Stine, R. A. (2008). ","element":"span"},{"style":{"height":8.4},"width":28,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/28-0.png","element":"img","alt":"α","inline":true},{"text":"-investing: a procedure for sequential control of expected false discoveries. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Journal of the Royal Statistical Society: Series B (Statistical Methodology)","element":"span"},{"text":", 70(2):429–444.","element":"span"}],[{"id":"id-15","text":"Genovese, C. R., Roeder, K., and Wasserman, L. (2006). False discovery control with p-value ","element":"span"},{"text":"weighting. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Biometrika","element":"span"},{"text":", 93(3):509–524.","element":"span"}],[{"id":"id-2","text":"Holm, S. (1979). A simple sequentially rejective multiple test procedure. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Scandinavian journal of statistics","element":"span"},{"text":", pages 65–70.","element":"span"}],[{"id":"id-17","text":"Hu, J. X., Zhao, H., and Zhou, H. H. (2010). False discovery rate control with groups. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Journal of the American Statistical Association","element":"span"},{"text":", 105(491):1215–1227.","element":"span"}],[{"id":"id-6","text":"Javanmard, A., Montanari, A., et al. (2018). Online rules for control of false discovery rate ","element":"span"},{"text":"and false discovery exceedance. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"The Annals of statistics","element":"span"},{"text":", 46(2):526–554.","element":"span"}],[{"id":"id-51","text":"Jin, J. and Cai, T. T. (2007). Estimating the null and the proportional of nonnull effects in ","element":"span"},{"text":"large-scale multiple comparisons. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J. Amer. Statist. Assoc.","element":"span"},{"text":", ","element":"span"},{"style":{"fontWeight":"bold"},"text":"102","element":"span"},{"text":":495–506.","element":"span"}],[{"id":"id-55","text":"Karp, N. A., Mason, J., Beaudet, A. L., Benjamini, Y., Bower, L., Braun, R. E., Brown, ","element":"span"},{"text":"S. D., Chesler, E. J., Dickinson, M. E., Flenniken, A. M., et al. (2017). Prevalence of sexual dimorphism in mammalian phenotypic traits. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Nature communications","element":"span"},{"text":", 8:15475.","element":"span"}],[{"id":"id-18","text":"Lei, L. and Fithian, W. (2018). Adapt: an interactive procedure for multiple testing with side ","element":"span"},{"text":"information. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J. Roy. Statist. Soc. B","element":"span"},{"text":", 80(4):649–679.","element":"span"}],[{"id":"id-19","text":"Li, A. and Barber, R. F. (2019). ","element":"span"},{"text":"Multiple testing with the structure-adaptive benjamini– hochberg algorithm. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Journal of the Royal Statistical Society: Series B (Statistical Methodology)","element":"span"},{"text":", 81(1):45–74.","element":"span"}],[{"id":"id-8","text":"Lynch, G., Guo, W., Sarkar, S. K., Finner, H., et al. (2017). The control of the false discovery ","element":"span"},{"text":"rate in fixed sequence multiple testing. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Electronic Journal of Statistics","element":"span"},{"text":", 11(2):4649–4673.","element":"span"}],[{"id":"id-5","text":"Ramdas, A., Yang, F., Wainwright, M. J., and Jordan, M. I. (2017). Online control of the ","element":"span"},{"text":"false discovery rate with decaying memory. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Advances In Neural Information Processing Systems","element":"span"},{"text":", pages 5650–5659.","element":"span"}],[{"id":"id-9","text":"Ramdas, A., Zrnic, T., Wainwright, M., and Jordan, M. (2018). Saffron: an adaptive algo- ","element":"span"},{"text":"rithm for online control of the false discovery rate. In ","element":"span"},{"style":{"fontStyle":"italic"},"text":"International Conference on Machine Learning","element":"span"},{"text":", pages 4286–4294.","element":"span"}],[{"id":"id-7","text":"Robertson, D. S. and Wason, J. (2018). Online control of the false discovery rate in biomedical ","element":"span"},{"text":"research. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"arXiv preprint arXiv:1809.07292","element":"span"},{"text":".","element":"span"}],[{"id":"id-30","text":"Silverman, B. W. (1986). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Density estimation for statistics and data analysis","element":"span"},{"text":", volume 26. CRC press.","element":"span"}],[{"id":"id-38","text":"Stein, M. L. (2012). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Interpolation of spatial data: some theory for kriging","element":"span"},{"text":". Springer Science & Business Media.","element":"span"}],[{"id":"id-58","text":"Sun, W. and Cai, T. T. (2007). Oracle and adaptive compound decision rules for false discovery ","element":"span"},{"text":"rate control. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J. Amer. Statist. Assoc.","element":"span"},{"text":", ","element":"span"},{"style":{"fontWeight":"bold"},"text":"102","element":"span"},{"text":":901–912.","element":"span"}],[{"id":"id-41","text":"Wand, M. P. and Jones, M. C. (1994). ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Kernel smoothing","element":"span"},{"text":". Chapman and Hall/CRC.","element":"span"}],[{"id":"id-20","text":"Xia, Y., Cai, T. T., and Sun, W. (2020+). Gap: A general framework for information pooling ","element":"span"},{"text":"in two-sample sparse inference. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"J. Am. Statist. Assoc., to appear.","element":"span"}],[{"style":{"width":"92%"},"width":1648,"height":205,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/30-0.png","element":"img"}],[{"text":"This supplement contains the proofs of main theorems (Section A), other theoretical results (Section B), and optimality theory on simultaneous testing (Section C).","element":"span"}]]},{"heading":"A Proof of main theorems","paragraphs":[[{"style":{"fontWeight":"bold"},"text":"A.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Theorem 1","element":"span"}],[{"text":"Note that the Clfdr is defined as Clfdr","element":"span"},{"style":{"height":17.6},"width":299.74,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/30-1.png","element":"img","alt":"i = P(θi = 0|Xi","inline":true},{"text":"). Then by the definition of FDR and double expectation theorem, we have:","element":"span"}],[{"style":{"width":"42%"},"width":752,"height":135,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/30-2.png","element":"img"}],[{"text":"By construction of the decision rule, (","element":"span"},{"style":{"height":21.49},"width":545.33,"height":53.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/30-3.png","element":"img","alt":"|Rt| ∨ 1)−1 �i∈Rt Clfdri ≤ α","inline":true,"padRight":true},{"text":"for all realization of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"X","element":"span"},{"style":{"fontStyle":"italic"},"text":"XX","element":"span"},{"text":". It follows that FDR","element":"span"},{"style":{"height":13.82},"width":112.48,"height":34.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/30-4.png","element":"img","alt":"t ≤ α.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"A.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Theorem ","element":"span"},{"href":"#id-42","style":{"fontWeight":"bold"},"text":"2","element":"a"}],[{"text":"We need the following lemma:","element":"span"}],[{"style":{"width":"3%"},"width":62,"height":6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/30-5.png","element":"img"}],[{"id":"id-57","style":{"height":20.82},"width":708.45,"height":52.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/30-6.png","element":"img","alt":"Lemma 1. Suppose an p−→ 0 and |an|","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is bounded for all ","element":"span"},{"style":{"height":23.27},"width":240.65,"height":58.18,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/30-7.png","element":"img","alt":" n, then limn→∞","inline":true}],[{"text":"The proof of lemma ","element":"span"},{"href":"#id-57","text":"1 ","element":"a"},{"text":"is elementary thus omitted. ","element":"span"},{"text":"By definition of our algorithm, if","element":"span"}],[{"style":{"width":"88%"},"width":1572,"height":409,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/30-8.png","element":"img"}],[{"text":"Note that Clfdr","element":"span"},{"style":{"height":8.4},"width":12,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-0.png","element":"img","alt":"i","inline":true,"padRight":true},{"text":"is a random variable from random mixture model ","element":"span"},{"href":"#id-11","text":"(2.1) ","element":"a"},{"text":"with a non-vanishing","element":"span"}],[{"text":"proportion of nonzero signals, we have","element":"span"}],[{"style":{"width":"38%"},"width":686,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-1.png","element":"img"}],[{"text":"for every ","element":"span"},{"style":{"height":19.2},"width":1594.48,"height":48.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-2.png","element":"img","alt":" M. We have P {�∞i=1 I(Clfdrτi < α − ϵ) < ∞} = 0. Now, �∞i=1 I( �Clfdri ≤ α) < ∞","inline":true,"padRight":true},{"text":"would imply ","element":"span"},{"style":{"height":17.6},"width":371.3,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-3.png","element":"img","alt":" | �Clfdri−Clfdrτi | > ϵ","inline":true,"padRight":true},{"text":"infinitely many times. By Proposition ","element":"span"},{"href":"#id-37","text":"2, ","element":"a"},{"style":{"height":17.6},"width":384.85,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-4.png","element":"img","alt":" P(| �Clfdri−Clfdrτi | >","inline":true},{"style":{"height":17.6},"width":91.86,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-5.png","element":"img","alt":"ϵ) →","inline":true,"padRight":true},{"text":"0. It follows that ","element":"span"},{"style":{"height":32},"width":973.79,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-6.png","element":"img","alt":" P��∞i=1 I( �Clfdri ≤ α) < ∞�= 0, hence |Rt| → ∞","inline":true},{"text":". By Proposition ","element":"span"},{"href":"#id-37","text":"2","element":"a"}],[{"style":{"width":"71%"},"width":1277,"height":250,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-7.png","element":"img"}],[{"text":"Finally, the operation of Algorithm 2 implies that","element":"span"}],[{"style":{"width":"64%"},"width":1150,"height":135,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-8.png","element":"img"}],[{"text":"FDR(","element":"span"},{"style":{"height":17.6},"width":159.23,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-9.png","element":"img","alt":"δδδ) = EX","inline":true}],[{"style":{"width":"64%"},"width":1150,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-10.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"A.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of theorem 3","element":"span"}],[{"text":"Note when both ","element":"span"},{"style":{"height":16.4},"width":166.18,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-11.png","element":"img","alt":" ft and πt","inline":true,"padRight":true},{"text":"are fixed over time, the Clfdr statistic reduces to Lfdr","element":"span"},{"style":{"height":27.65},"width":283.64,"height":69.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-12.png","element":"img","alt":"i := (1−π)f0(xi)f(xi) .","inline":true,"padRight":true},{"text":"The optimal threshold in the offline simultaneous testing setup would be independent of time ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"and the chosen neighborhood. The oracle offline rule coincides with the oracle procedure described in Section 3.2 of ","element":"span"},{"href":"#id-58","referenceIndex":24,"text":"Sun and Cai ","element":"a"},{"href":"#id-58","referenceIndex":24,"text":"(2007)","element":"a"},{"text":".","element":"span"}],[{"text":"We now introduce some notations:","element":"span"}],[{"text":"• ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":24.4},"width":872.76,"height":61.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-13.png","element":"img","alt":"U t(γ) = t−1 �ti=1( �Clfdr(i) − α)I �{Clfdr(i) < γ}","inline":true}],[{"text":"• ","element":"span"},{"style":{"height":24.53},"width":884.59,"height":61.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-14.png","element":"img","alt":" U t(γ) = t−1 �ti=1(Clfdrτ(i) − α)I{Clfdrτ(i) < γ}.","inline":true}],[{"text":"• ","element":"span"},{"style":{"height":18.73},"width":764.69,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-15.png","element":"img","alt":" U t∞(γ) = E{(Clfdrτ − α)I{Clfdrτ < γ}}.","inline":true}],[{"text":"• ","element":"span"},{"style":{"height":18.74},"width":625.48,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-16.png","element":"img","alt":" γ∞ = sup{γ ∈ (0, 1), Ut∞(γ) ≤ 0}","inline":true,"padRight":true},{"text":"is the “ideal” threshold.","element":"span"}],[{"text":"Note that ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":14.73},"width":46.55,"height":36.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-17.png","element":"img","alt":"U t ","inline":true,"padRight":true},{"text":"is discrete. To facilitate the theoretical analysis, we define, for ","element":"span"},{"style":{"height":19.15},"width":267.51,"height":47.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/31-18.png","element":"img","alt":"�Clfdr(i) < γ <","inline":true}],[{"style":{"width":"81%"},"width":1456,"height":277,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-0.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":23.56},"width":312.5,"height":58.91,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-1.png","element":"img","alt":"U ti = ˆU t( �Clfdr(i)","inline":true},{"text":"). It is easy to verify that ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":19.51},"width":55.8,"height":48.77,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-2.png","element":"img","alt":"U tC ","inline":true,"padRight":true},{"text":"is continuous and monotone. Hence its ","element":"span"},{"text":"inverse ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":22.39},"width":99.54,"height":55.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-3.png","element":"img","alt":"U t,−1C","inline":true,"padRight":true},{"text":"is well defined, continuous and monotone.","element":"span"}],[{"style":{"width":"97%"},"width":1735,"height":58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-4.png","element":"img"}],[{"style":{"height":20.82},"width":795.34,"height":52.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-5.png","element":"img","alt":"Proof of (i). Note that U t(γ) p−→ U t∞(γ","inline":true},{"text":") by the WLLN, so that we only need to establish ","element":"span"},{"text":"that ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":20.82},"width":265.36,"height":52.06,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-6.png","element":"img","alt":"U t(γ) p−→ U t(γ","inline":true},{"text":"). We need to following lemma:","element":"span"}],[{"id":"id-59","style":{"width":"100%"},"width":1781,"height":163,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-7.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"Proof of Lemma ","element":"span"},{"href":"#id-59","style":{"fontWeight":"bold"},"text":"2","element":"a"},{"text":". Using the definitions of ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":15.02},"width":175.83,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-8.png","element":"img","alt":"Vi and Vi","inline":true},{"text":", we can show that","element":"span"}],[{"style":{"width":"107%"},"width":1914,"height":193,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-9.png","element":"img"}],[{"text":"Let us refer to the three sums on the right hand as ","element":"span"},{"style":{"fontStyle":"italic"},"text":"I","element":"span"},{"text":", ","element":"span"},{"style":{"fontStyle":"italic"},"text":"II","element":"span"},{"text":", and ","element":"span"},{"style":{"fontStyle":"italic"},"text":"III ","element":"span"},{"text":"respectively. By step 2 in","element":"span"}],[{"text":"the proof of Theorem 2, ","element":"span"},{"style":{"fontStyle":"italic"},"text":"I ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"o","element":"span"},{"text":"(1). Then let ","element":"span"},{"style":{"height":10.4},"width":66.46,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-10.png","element":"img","alt":" ε >","inline":true,"padRight":true},{"text":"0, and consider that","element":"span"}],[{"style":{"width":"102%"},"width":1823,"height":186,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-11.png","element":"img"}],[{"text":"The first term on the right hand is vanishingly small as ","element":"span"},{"style":{"height":15.02},"width":398.94,"height":37.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-12.png","element":"img","alt":" ε → 0 because �Clfdri","inline":true,"padRight":true},{"text":"is a continuous random variable. The second term converges to 0 by Proposition ","element":"span"},{"href":"#id-37","text":"2. ","element":"a"},{"text":"Noting that 0 ","element":"span"},{"style":{"height":15.6},"width":249.89,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/32-13.png","element":"img","alt":" ≤ �Clfdri ≤ 1,","inline":true,"padRight":true},{"text":"we conclude ","element":"span"},{"style":{"fontStyle":"italic"},"text":"II ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"o","element":"span"},{"text":"(1). In a similar fashion, we can show that ","element":"span"},{"style":{"fontStyle":"italic"},"text":"III ","element":"span"},{"text":"= ","element":"span"},{"style":{"fontStyle":"italic"},"text":"o","element":"span"},{"text":"(1), thus proving the lemma.","element":"span"}],[{"style":{"width":"79%"},"width":1420,"height":227,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-0.png","element":"img"}],[{"text":"It follows that","element":"span"}],[{"style":{"width":"59%"},"width":1066,"height":393,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-1.png","element":"img"}],[{"text":"By Proposition ","element":"span"},{"href":"#id-37","text":"2, ","element":"a"},{"style":{"height":20.8},"width":234.5,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-2.png","element":"img","alt":" E�t−1St�→","inline":true,"padRight":true},{"text":"0, applying Chebyshev’s inequality, we obtain","element":"span"}],[{"style":{"width":"22%"},"width":409,"height":51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-3.png","element":"img"}],[{"text":"establishing (i).","element":"span"}],[{"style":{"height":21.98},"width":522.85,"height":54.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-4.png","element":"img","alt":"Proof of (ii). Since ˆU tC ","inline":true,"padRight":true},{"text":"is continuous, for any ","element":"span"},{"style":{"height":10.4},"width":78.94,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-5.png","element":"img","alt":" ϵ >","inline":true,"padRight":true},{"text":"0, we can find ","element":"span"},{"style":{"height":13.6},"width":84.48,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-6.png","element":"img","alt":" η >","inline":true,"padRight":true},{"text":"0 such that ","element":"span"},{"style":{"height":32.7},"width":964.64,"height":81.74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-7.png","element":"img","alt":"�� ˆU t,−1C (0) − ˆU t,−1C �ˆU tC (γ∞)�� < ε if�� ˆU tC (γ∞)�� < η","inline":true},{"text":". It follows that","element":"span"}],[{"style":{"width":"65%"},"width":1170,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-8.png","element":"img"}],[{"text":"Proposition ","element":"span"},{"href":"#id-37","text":"2 ","element":"a"},{"text":"and the WLLN imply that ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":21.6},"width":682.79,"height":53.99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-9.png","element":"img","alt":"U tC(γ) p→ U t∞(γ). Note that U t∞ (γ∞","inline":true},{"text":") = 0, then,","element":"span"}],[{"style":{"width":"25%"},"width":452,"height":82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-10.png","element":"img"}],[{"text":"Hence, we have","element":"span"}],[{"style":{"width":"68%"},"width":1224,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/33-11.png","element":"img"}],[{"text":"completing the proof of (ii).","element":"span"}]]},{"heading":"B Proof of propositions","paragraphs":[[{"style":{"fontWeight":"bold"},"text":"B.1 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-32","style":{"fontWeight":"bold"},"text":"1","element":"a"}],[{"text":"Let ","element":"span"},{"style":{"height":17.6},"width":648.45,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-0.png","element":"img","alt":" Aτ = {x : P0(x) > τ}, where P0(x","inline":true},{"text":") is the p-value of ","element":"span"},{"style":{"fontStyle":"italic"},"text":"x","element":"span"},{"text":". Then","element":"span"}],[{"style":{"width":"66%"},"width":1180,"height":311,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-1.png","element":"img"}],[{"text":"Hence ","element":"span"},{"style":{"height":19.13},"width":1002.28,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-2.png","element":"img","alt":" πτt = 1 − (1 − τ)−1P(Pt > τ) ≤ 1 − (1 − πt) = πt.","inline":true,"padRight":true},{"text":"By definition of Clfdr","element":"span"},{"style":{"height":15.6},"width":201.74,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-3.png","element":"img","alt":"t, we have","inline":true,"padRight":true},{"text":"Clfdr","element":"span"},{"style":{"height":16.99},"width":203.41,"height":42.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-4.png","element":"img","alt":"τt ≥ Clfdrt.","inline":true}],[{"style":{"width":"95%"},"width":1708,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-5.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"fontStyle":"italic"},"text":"R ","element":"span"},{"text":"be the index set of hypotheses rejected by ","element":"span"},{"style":{"height":18.06},"width":75.64,"height":45.14,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-6.png","element":"img","alt":" δδδτOR","inline":true},{"text":". The FDR of ","element":"span"},{"style":{"height":18.06},"width":121.01,"height":45.14,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-7.png","element":"img","alt":" δδδτOR is","inline":true}],[{"style":{"width":"73%"},"width":1305,"height":710,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-8.png","element":"img"}],[{"text":"The last inequality is due to the definition of ","element":"span"},{"style":{"height":18.05},"width":75.64,"height":45.14,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-9.png","element":"img","alt":" δδδτOR ","inline":true,"padRight":true},{"text":"which guarantees that","element":"span"}],[{"style":{"width":"24%"},"width":428,"height":115,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/34-10.png","element":"img"}],[{"style":{"fontWeight":"bold"},"text":"B.2 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Proposition ","element":"span"},{"href":"#id-37","style":{"fontWeight":"bold"},"text":"2","element":"a"}],[{"text":"Under the in-fill model, we write","element":"span"}],[{"style":{"width":"48%"},"width":859,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-0.png","element":"img"}],[{"text":"We first state 3 lemmas that will be proved in turn.","element":"span"}],[{"id":"id-60","style":{"fontWeight":"bold"},"text":"Lemma 3. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Under the assumption of Proposition ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"2, ","element":"a"},{"style":{"height":35.25},"width":548.73,"height":88.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-1.png","element":"img","alt":" E� �ˆft(x) − ft(x)�2dx → 0.","inline":true}],[{"id":"id-61","style":{"fontWeight":"bold"},"text":"Lemma 4. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Under the assumptions of Proposition ","element":"span"},{"href":"#id-37","style":{"fontStyle":"italic"},"text":"2,","element":"a"}],[{"style":{"width":"47%"},"width":852,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-2.png","element":"img"}],[{"id":"id-62","style":{"height":21.4},"width":619.69,"height":53.51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-3.png","element":"img","alt":"Lemma 5. Let ˆπτt , ˆft(x), and ˆf0 ","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"be estimates such that ","element":"span"},{"style":{"height":21.4},"width":722.82,"height":53.51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-4.png","element":"img","alt":" E∥ˆπτt −πτt ∥2 → 0, E∥ ˆft(x)−ft(x)∥2 →","inline":true,"padRight":true},{"text":"0","element":"span"},{"style":{"fontStyle":"italic"},"text":", ","element":"span"},{"style":{"height":21.41},"width":1001.57,"height":53.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-5.png","element":"img","alt":" E∥ ˆf0 − f0∥2 → 0, and then E∥ �Clfdrt − Clfdrτt ∥2 → 0.","inline":true}],[{"text":"By Lemma ","element":"span"},{"href":"#id-60","text":"3 ","element":"a"},{"text":"and Lemma ","element":"span"},{"href":"#id-61","text":"4, ","element":"a"},{"text":"together with the fact that ","element":"span"},{"style":{"height":16.4},"width":38.36,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-6.png","element":"img","alt":" f0","inline":true,"padRight":true},{"text":"is known, it follows from Lemma ","element":"span"},{"href":"#id-62","text":"5 ","element":"a"},{"text":"that ","element":"span"},{"style":{"height":19.13},"width":479.48,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-7.png","element":"img","alt":" E∥ �Clfdrt − Clfdrτt ∥2 → 0.","inline":true,"padRight":true},{"text":"Since convergence in second order mean implies convergence","element":"span"}],[{"text":"in probability, we have","element":"span"}],[{"style":{"width":"17%"},"width":313,"height":59,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-8.png","element":"img"}],[{"id":"id-39","style":{"fontWeight":"bold"},"text":"B.3 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Growing domain version of Proposition ","element":"span"},{"href":"#id-37","style":{"fontWeight":"bold"},"text":"2","element":"a"}],[{"text":"In the growing domain framework, Proposition ","element":"span"},{"href":"#id-37","text":"2 ","element":"a"},{"text":"takes the following form:","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proposition 3. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Suppose:","element":"span"}],[{"style":{"width":"98%"},"width":1752,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-9.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"T","element":"span"},{"style":{"height":19.6},"width":636.05,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-10.png","element":"img","alt":", we have�|fi(x) − fj(x)|dx < ϵ.","inline":true}],[{"style":{"width":"51%"},"width":922,"height":305,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/35-11.png","element":"img"}],[{"text":"The proof follows the same line as the proof of proposition ","element":"span"},{"href":"#id-37","text":"2, ","element":"a"},{"text":"thus omitted.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"B.4 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of Lemma ","element":"span"},{"href":"#id-60","style":{"fontWeight":"bold"},"text":"3","element":"a"}],[{"text":"We first compute ","element":"span"},{"style":{"height":21.41},"width":260.4,"height":53.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-0.png","element":"img","alt":" E ˆft(x) − ft(x","inline":true},{"text":"). Note that ","element":"span"},{"style":{"height":19.6},"width":893.74,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-1.png","element":"img","alt":" EKhx(Xj − x) =�K(z)fj(x − hxz)dz. Using","inline":true}],[{"text":"Taylor expansion, we have","element":"span"}],[{"style":{"width":"58%"},"width":1035,"height":89,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-2.png","element":"img"}],[{"text":"It follows that","element":"span"}],[{"style":{"width":"76%"},"width":1364,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-3.png","element":"img"}],[{"text":"Let ","element":"span"},{"style":{"height":32},"width":1516.65,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-4.png","element":"img","alt":" A = �t−1j=t−d+1�Kht(1 − j/t) 12h2xf′′j (x)�z2K(z)dz + Kht(1 − j/t)o(h2x)�. Then","inline":true}],[{"style":{"width":"91%"},"width":1622,"height":339,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-5.png","element":"img"}],[{"text":"To see why the last expression goes to 0, note that for any ","element":"span"},{"style":{"height":14.8},"width":109.71,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-6.png","element":"img","alt":" ϵ > 0,","inline":true,"padRight":true},{"text":"by Assumption (A1), we can","element":"span"}],[{"text":"take ","element":"span"},{"style":{"height":12.8},"width":20,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-7.png","element":"img","alt":" δ","inline":true,"padRight":true},{"text":"such that for all ","element":"span"},{"style":{"height":19.6},"width":830.11,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-8.png","element":"img","alt":" i > (1 − δ)t,�|fi(x) − ft(x)|dx < ϵ. Hence,","inline":true}],[{"style":{"width":"99%"},"width":1778,"height":444,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-9.png","element":"img"}],[{"text":"Note that ","element":"span"},{"style":{"height":15.02},"width":99.65,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-10.png","element":"img","alt":" ht →","inline":true,"padRight":true},{"text":"0, we conclude that ","element":"span"},{"style":{"height":32},"width":1062.24,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-11.png","element":"img","alt":"�⌊(1−δ)t⌋j=t−d+1 Kht(1 − j/t) = O�� 1δ Kht(x)dx�→ 0. Also,","inline":true,"padRight":true},{"text":"since ","element":"span"},{"style":{"height":24},"width":1387.5,"height":60.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/36-12.png","element":"img","alt":" dht → ∞ as t → ∞, we have �t−1j=t−d+1 Kht(1 − j/t) ≥ c′h−1t for some c′.","inline":true}],[{"style":{"width":"24%"},"width":431,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-0.png","element":"img"}],[{"text":"Thus","element":"span"}],[{"style":{"width":"70%"},"width":1262,"height":243,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-1.png","element":"img"}],[{"text":"It follows from the boundedness of ","element":"span"},{"style":{"height":21.41},"width":353.14,"height":53.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-2.png","element":"img","alt":" E ˆft and ft(x) that","inline":true}],[{"id":"id-63","style":{"width":"64%"},"width":1142,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-3.png","element":"img"}],[{"text":"Next we compute ","element":"span"},{"style":{"height":32},"width":234.46,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-4.png","element":"img","alt":" Var�ˆft(x)�:","inline":true}],[{"style":{"width":"72%"},"width":1292,"height":414,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-5.png","element":"img"}],[{"text":"Some additional calculations give","element":"span"}],[{"style":{"width":"65%"},"width":1161,"height":525,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-6.png","element":"img"}],[{"text":"Therefore, by assumption (A3) and (A4),","element":"span"}],[{"id":"id-64","style":{"width":"69%"},"width":1229,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-7.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":22.55},"width":1075.06,"height":56.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-8.png","element":"img","alt":" E�{ ˆft(x)−ft(x)}2 =�{E ˆft(x)−ft(x)}2 +Var{ ˆft(x)}dx,","inline":true,"padRight":true},{"href":"#id-63","text":"(B.2) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-64","text":"(B.3) ","element":"a"},{"text":"together imply that ","element":"span"},{"style":{"height":22.55},"width":471.98,"height":56.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/37-9.png","element":"img","alt":" E�{ ˆft(x) − ft(x)}2 → 0.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"B.5 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of lemma ","element":"span"},{"href":"#id-61","style":{"fontWeight":"bold"},"text":"4","element":"a"}],[{"text":"Define ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":17.6},"width":243.31,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/38-0.png","element":"img","alt":"P(Pt > τ) :=","inline":true}],[{"id":"id-66","style":{"width":"77%"},"width":1380,"height":143,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/38-1.png","element":"img"}],[{"text":"We first rewrite the term","element":"span"}],[{"id":"id-67","style":{"width":"99%"},"width":1775,"height":255,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/38-2.png","element":"img"}],[{"style":{"height":17.6},"width":197.15,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/38-3.png","element":"img","alt":"P(x) > τ}","inline":true,"padRight":true},{"text":"use the definition of ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":16.4},"width":201.04,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/38-4.png","element":"img","alt":"ft we have","inline":true}],[{"style":{"width":"83%"},"width":1483,"height":138,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/38-5.png","element":"img"}],[{"text":"To show the lemma, it is sufficient to show","element":"span"}],[{"id":"id-65","style":{"width":"74%"},"width":1332,"height":107,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/38-6.png","element":"img"}],[{"text":"To see why ","element":"span"},{"href":"#id-65","text":"(B.6) ","element":"a"},{"text":"implies ","element":"span"},{"href":"#id-66","text":"(B.4)","element":"a"},{"text":", note that ","element":"span"},{"href":"#id-65","text":"(B.6) ","element":"a"},{"text":"implies","element":"span"}],[{"style":{"width":"97%"},"width":1739,"height":180,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/38-7.png","element":"img"}],[{"text":"Next note that","element":"span"}],[{"style":{"width":"97%"},"width":1734,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/38-8.png","element":"img"}],[{"style":{"width":"86%"},"width":1534,"height":358,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/39-0.png","element":"img"}],[{"text":"By (A4), we have","element":"span"},{"style":{"height":35.25},"width":405.16,"height":88.13,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/39-1.png","element":"img","alt":"�� 10 Kht(x)dx�2 ≥ c","inline":true,"padRight":true},{"text":"for some constant ","element":"span"},{"style":{"height":16},"width":617.54,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/39-2.png","element":"img","alt":" c > 0. Now tht → ∞, implies","inline":true}],[{"style":{"width":"41%"},"width":746,"height":99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/39-3.png","element":"img"}],[{"text":"By Chebyshev’s inequality,","element":"span"}],[{"style":{"width":"86%"},"width":1542,"height":128,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/39-4.png","element":"img"}],[{"text":"Combining ","element":"span"},{"href":"#id-65","text":"(B.7) ","element":"a"},{"text":", ","element":"span"},{"href":"#id-67","text":"(B.5)","element":"a"},{"text":", (A1) and (A2),","element":"span"}],[{"style":{"width":"67%"},"width":1194,"height":127,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/39-5.png","element":"img"}],[{"text":"Therefore ","element":"span"},{"href":"#id-66","text":"(B.4) ","element":"a"},{"text":"follows.","element":"span"}],[{"text":"We now show ","element":"span"},{"href":"#id-65","text":"(B.6)","element":"a"},{"text":". Let ","element":"span"},{"style":{"height":17.6},"width":297.41,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/39-6.png","element":"img","alt":" ϵ = √hx. Write","inline":true}],[{"style":{"width":"103%"},"width":1835,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/39-7.png","element":"img"}],[{"text":"Use the normal tail bound,","element":"span"}],[{"style":{"width":"62%"},"width":1113,"height":197,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/39-8.png","element":"img"}],[{"text":"Define ","element":"span"},{"style":{"height":18.22},"width":554.93,"height":45.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-0.png","element":"img","alt":" Aτ = {xj : P(xj) > τ}, let fj","inline":true,"padRight":true},{"text":"be the density function for ","element":"span"},{"style":{"height":17.02},"width":51.15,"height":42.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-1.png","element":"img","alt":" Xj","inline":true},{"text":". Note that","element":"span"}],[{"style":{"width":"86%"},"width":1546,"height":477,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-2.png","element":"img"}],[{"text":"Hence ","element":"span"},{"href":"#id-65","text":"(B.6) ","element":"a"},{"text":"is proved. The lemma follows.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"B.6 ","element":"span"},{"style":{"fontWeight":"bold"},"text":"Proof of lemma ","element":"span"},{"href":"#id-62","style":{"fontWeight":"bold"},"text":"5","element":"a"}],[{"text":"Note that ","element":"span"},{"style":{"height":17.6},"width":77.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-3.png","element":"img","alt":" ft(x","inline":true},{"text":") is continuous and positive on the real line, then there exists ","element":"span"},{"style":{"height":17.6},"width":289.51,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-4.png","element":"img","alt":" K1 = [−M, M]","inline":true,"padRight":true},{"text":"such that ","element":"span"},{"style":{"height":17.88},"width":524.15,"height":44.69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-5.png","element":"img","alt":" P(x ∈ Kc1) → 0 as M → ∞.","inline":true}],[{"style":{"width":"75%"},"width":1353,"height":213,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-6.png","element":"img"}],[{"text":"we claim that ","element":"span"},{"style":{"height":21},"width":164.04,"height":52.51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-7.png","element":"img","alt":" ft and ˆft","inline":true,"padRight":true},{"text":"are bounded below by a positive number for large ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t ","element":"span"},{"text":"except for an event that has a low probability. Similar arguments can be applied to the upper bound of ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":16.4},"width":180.22,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-8.png","element":"img","alt":"ft and ft,","inline":true,"padRight":true},{"text":"as well as the cases for ","element":"span"},{"style":{"height":21.01},"width":170.43,"height":52.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-9.png","element":"img","alt":" f0 and ˆf0","inline":true},{"text":". Therefore, we conclude that ","element":"span"},{"style":{"height":21.01},"width":317.23,"height":52.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-10.png","element":"img","alt":" f0, ˆf0 , ft, and ˆft","inline":true,"padRight":true},{"text":"are all bounded in the interval [","element":"span"},{"style":{"height":17.6},"width":640.41,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-11.png","element":"img","alt":"la, lb], 0 < la < lb < ∞ for large t","inline":true,"padRight":true},{"text":"except for an event ","element":"span"},{"style":{"height":15.42},"width":48.73,"height":38.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-12.png","element":"img","alt":" Aϵ","inline":true,"padRight":true},{"text":"that has probability tends to 0. Hence 0 ","element":"span"},{"style":{"height":24.39},"width":1380.38,"height":60.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-13.png","element":"img","alt":" < la < infz∈Aϵ min{f0, ˆf0, ft, ˆft} < supz∈Acϵ max{f0, ˆf0, ft, ˆft} < lb < ∞.","inline":true}],[{"text":"Next note that","element":"span"}],[{"style":{"width":"80%"},"width":1430,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-14.png","element":"img"}],[{"text":"we conclude that","element":"span"}],[{"style":{"width":"77%"},"width":1387,"height":88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/40-15.png","element":"img"}],[{"style":{"width":"93%"},"width":1659,"height":216,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-0.png","element":"img"}],[{"text":"According to the assumptions, we further have that for a given ","element":"span"},{"style":{"height":10.4},"width":67.51,"height":26,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-1.png","element":"img","alt":" ϵ >","inline":true,"padRight":true},{"text":"0, there exists ","element":"span"},{"style":{"height":15.13},"width":162.83,"height":37.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-2.png","element":"img","alt":" M ∈ Z+","inline":true,"padRight":true},{"text":"such that we can find ","element":"span"},{"style":{"height":17.6},"width":372.3,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-3.png","element":"img","alt":" Aϵ, P(Aϵ) < ϵ/(4L","inline":true},{"text":"), and at the same time ","element":"span"},{"style":{"height":19.13},"width":464.66,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-4.png","element":"img","alt":" E∥ˆπτt − πτt ∥2 < ϵ/(4c1),","inline":true},{"style":{"height":21.41},"width":1304.04,"height":53.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-5.png","element":"img","alt":"E∥ ˆft − ft∥2 < ϵ/(4c2), and E∥ ˆf0 − f0∥2 < ϵ/(4c3) for all t ≥ M.","inline":true,"padRight":true},{"text":"Consequently, we have ","element":"span"},{"style":{"height":23.39},"width":660.17,"height":58.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-6.png","element":"img","alt":"E∥ �Clfdrτt − Clfdrτt ∥2 < ϵ for t ≥ M","inline":true},{"text":", and the desired result follows.","element":"span"}]]},{"heading":"C Optimality of the Clfdr rule in simultaneous testing","paragraphs":[[{"text":"The optimality of the Clfdr rule in simultaneous testing is summarized in the following proposition. The idea in the proof essentially follows that in ","element":"span"},{"href":"#id-14","referenceIndex":6,"text":"Cai et al. ","element":"a"},{"href":"#id-14","referenceIndex":6,"text":"(2019)","element":"a"},{"text":". We provide it here for completeness.","element":"span"}],[{"style":{"fontWeight":"bold"},"text":"Proposition 4. ","element":"span"},{"style":{"fontStyle":"italic"},"text":"Consider a class of decision rules ","element":"span"},{"style":{"height":17.6},"width":787.15,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-7.png","element":"img","alt":" δδδ(γ) = {I(CLfdri < γ) : 1 ≤ i ≤ m} for","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"simultaneous testing of hypotheses ","element":"span"},{"style":{"height":18.4},"width":307.62,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-8.png","element":"img","alt":" {Hi : i ∈ Nd(t)}","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"in the neighborhood of ","element":"span"},{"style":{"height":17.6},"width":349.48,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-9.png","element":"img","alt":" t. Denote QOR(γ)","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"the marginal FDR of ","element":"span"},{"style":{"height":17.6},"width":389.26,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-10.png","element":"img","alt":" δδδ(γ). If α < QOR(1)","inline":true},{"style":{"fontStyle":"italic"},"text":", then the oracle threshold ","element":"span"},{"style":{"height":17.6},"width":487.14,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-11.png","element":"img","alt":" γOR := sup{γ : QOR(γ) ≤","inline":true},{"style":{"height":17.6},"width":50.08,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-12.png","element":"img","alt":"α}","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"exists and is unique. Define the oracle rule ","element":"span"},{"style":{"height":17.6},"width":867.05,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-13.png","element":"img","alt":" δδδOR = {I(CLfdri ≤ γOR) : 1 ≤ i ≤ m}. Then","inline":true}],[{"style":{"height":15.59},"width":75.64,"height":38.97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-14.png","element":"img","alt":"δδδOR","inline":true,"padRight":true},{"style":{"fontStyle":"italic"},"text":"is optimal for simultaneous testing in the sense that","element":"span"}],[{"style":{"width":"84%"},"width":1499,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-15.png","element":"img"}],[{"style":{"fontStyle":"italic"},"text":"Proof. ","element":"span"},{"text":"The proof has two parts. In (a), we establish two properties of the testing rule that thresholds the Clfdr at an arbitrary ","element":"span"},{"style":{"height":17.6},"width":568.78,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-16.png","element":"img","alt":" γ, {I(Clfdri < γ) : 1 ≤ i ≤ m}","inline":true},{"text":". We show that it produces mFDR ","element":"span"},{"style":{"height":16},"width":252.17,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-17.png","element":"img","alt":" < γ for all γ","inline":true,"padRight":true},{"text":"and that its mFDR is monotonic in ","element":"span"},{"style":{"fontStyle":"italic"},"text":"t","element":"span"},{"text":". In (b) we show that when the threshold is ","element":"span"},{"style":{"height":11.6},"width":75.21,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-18.png","element":"img","alt":" γOR","inline":true},{"text":", the testing rule, ","element":"span"},{"style":{"height":15.9},"width":75.41,"height":39.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-19.png","element":"img","alt":" δOR","inline":true},{"text":", exactly attains the mFDR level and is optimal amongst all valid testing procedures controls mFDR at level ","element":"span"},{"style":{"height":8.4},"width":40.08,"height":21,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-20.png","element":"img","alt":" α.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Part(a). ","element":"span"},{"text":"For the testing rule ","element":"span"},{"style":{"height":18.62},"width":890.14,"height":46.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/41-21.png","element":"img","alt":" {I(Clfdri < γ) : 1 ≤ i ≤ m}, let QOR(γ) = αγ","inline":true},{"text":". We first show","element":"span"}],[{"id":"id-68","text":"that ","element":"span"},{"style":{"height":15.02},"width":131.44,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-0.png","element":"img","alt":" αγ < γ","inline":true},{"text":". Since Clfdr","element":"span"},{"style":{"height":17.6},"width":523.45,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-1.png","element":"img","alt":"i = P(θi = 0|Xi = xi), then","inline":true}],[{"style":{"width":"87%"},"width":1564,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-2.png","element":"img"}],[{"text":"where notation ","element":"span"},{"style":{"fontStyle":"italic"},"text":"E ","element":"span"},{"text":"is the expected value taken over (","element":"span"},{"style":{"height":15.6},"width":88.14,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-3.png","element":"img","alt":"X, θ","inline":true},{"text":"), notation ","element":"span"},{"style":{"height":14.74},"width":64.21,"height":36.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-4.png","element":"img","alt":" EX","inline":true,"padRight":true},{"text":"is the expectation taken over the distribution of (","element":"span"},{"style":{"height":19.95},"width":272.6,"height":49.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-5.png","element":"img","alt":"X), and Eθ|X","inline":true,"padRight":true},{"text":"is the expectation taken over ","element":"span"},{"style":{"height":12},"width":25,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-6.png","element":"img","alt":" θ","inline":true},{"text":", holding (","element":"span"},{"style":{"fontStyle":"italic","fontWeight":"bold"},"text":"X","element":"span"},{"text":")","element":"span"}],[{"id":"id-69","text":"fixed. We use ","element":"span"},{"href":"#id-68","text":"(C.8) ","element":"a"},{"text":"in the definition of ","element":"span"},{"style":{"height":17.6},"width":273.89,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-7.png","element":"img","alt":" QOR(γ) to get","inline":true}],[{"style":{"width":"72%"},"width":1282,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-8.png","element":"img"}],[{"text":"The equality above implies that ","element":"span"},{"style":{"height":15.02},"width":140,"height":37.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-9.png","element":"img","alt":" αγ < γ","inline":true},{"text":". To see this, consider that all potentially non–zero terms arise when Clfdr","element":"span"},{"style":{"height":14.8},"width":118.58,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-10.png","element":"img","alt":"i ≤ γ","inline":true},{"text":", and when this is the case, either (i) ","element":"span"},{"style":{"height":17.6},"width":420.51,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-11.png","element":"img","alt":" α ≤ Clfdri < γ, (ii)","inline":true,"padRight":true},{"text":"Clfdr","element":"span"},{"style":{"height":14.8},"width":181.97,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-12.png","element":"img","alt":"i ≤ α < γ","inline":true},{"text":", or (iii) Clfdr","element":"span"},{"style":{"height":14.8},"width":182.9,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-13.png","element":"img","alt":"i < γ ≤ α","inline":true},{"text":". Notice (i) produces zero or positive terms on the LHS of ","element":"span"},{"href":"#id-69","text":"(C.9)","element":"a"},{"text":", (ii) produces zero or negative terms, and (iii) produces negative terms. If ","element":"span"},{"style":{"height":16.62},"width":151.4,"height":41.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-14.png","element":"img","alt":" αγ ≥ γ,","inline":true,"padRight":true},{"text":"then only (iii) is possible, which contradicts the RHS. Thus, the testing rule is valid.","element":"span"}],[{"text":"Next, we show that ","element":"span"},{"style":{"height":17.6},"width":129.81,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-15.png","element":"img","alt":" QOR(γ","inline":true},{"text":") is nondecreasing in ","element":"span"},{"style":{"height":11.6},"width":24,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-16.png","element":"img","alt":" γ","inline":true},{"text":". That is, letting ","element":"span"},{"style":{"height":18.22},"width":431.52,"height":45.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-17.png","element":"img","alt":" Q(γj) = αj, if γ1 < γ2,","inline":true,"padRight":true},{"text":"then ","element":"span"},{"style":{"height":16.62},"width":193.54,"height":41.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-18.png","element":"img","alt":" αγ1 ≤ αγ2","inline":true},{"text":". We argue by contradiction. Suppose that ","element":"span"},{"style":{"height":16},"width":414.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-19.png","element":"img","alt":" γ1 < γ2 but α1 > α2","inline":true},{"text":". First, it cannot be that ","element":"span"},{"style":{"height":17.6},"width":243.54,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-20.png","element":"img","alt":" I(Clfdri < γ2","inline":true},{"text":") = 0 for all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"i","element":"span"},{"text":", because that implies ","element":"span"},{"style":{"height":10.62},"width":149.94,"height":26.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-21.png","element":"img","alt":" α1 = α2","inline":true,"padRight":true},{"text":"(both equal 0). Next,","element":"span"}],[{"text":"since ","element":"span"},{"style":{"height":13.2},"width":153.22,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-22.png","element":"img","alt":" γ1 < γ2,","inline":true}],[{"style":{"width":"97%"},"width":1729,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-23.png","element":"img"}],[{"text":"and rewrite (Clfdr","element":"span"},{"style":{"height":17.6},"width":1435.92,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-24.png","element":"img","alt":"i −α2)I(Clfdri < γ1) = (Clfdri −α1)I(Clfdri < γ1)+(α1 −α2)I(Clfdri < γ1).","inline":true}],[{"text":"If ","element":"span"},{"style":{"height":15.6},"width":263.14,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-25.png","element":"img","alt":" α2 < α1, then","inline":true}],[{"style":{"width":"96%"},"width":1723,"height":143,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-26.png","element":"img"}],[{"text":"It follows that","element":"span"}],[{"id":"id-70","style":{"width":"43%"},"width":766,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/42-27.png","element":"img"}],[{"text":"To see this, consider the expectation of the sum over ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"tests for the three RHS terms of ","element":"span"},{"href":"#id-70","text":"(C.10)","element":"a"},{"text":", which we reference as (i), (ii), and (iii) respectively. First, (i) is zero because of ","element":"span"},{"href":"#id-69","text":"(C.9)","element":"a"},{"text":". Then for each Clfdr","element":"span"},{"style":{"height":13.2},"width":116.14,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-0.png","element":"img","alt":"i < γ2","inline":true},{"text":", either (ii) is positive because ","element":"span"},{"style":{"height":12.22},"width":154.78,"height":30.55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-1.png","element":"img","alt":" α2 < α1","inline":true},{"text":", or (iii) is positive because","element":"span"}],[{"style":{"fontStyle":"italic"},"text":"α","element":"span"},{"style":{"height":13.2},"width":130.63,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-2.png","element":"img","alt":"1 < γ1.","inline":true}],[{"text":"However, ","element":"span"},{"href":"#id-69","text":"(C.9) ","element":"a"},{"text":"establishes that ","element":"span"},{"style":{"height":18.8},"width":691.44,"height":47.01,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-3.png","element":"img","alt":" E {�mi=1(Clfdri − α2)I(Clfdri < γ2)}","inline":true,"padRight":true},{"text":"= 0, leading to a ","element":"span"},{"text":"contradiction. Hence, ","element":"span"},{"style":{"height":12.22},"width":163.87,"height":30.54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-4.png","element":"img","alt":" α1 < α2.","inline":true}],[{"style":{"fontWeight":"bold"},"text":"Part(b). ","element":"span"},{"text":"The oracle threshold is defined as ","element":"span"},{"style":{"height":20.19},"width":720.18,"height":50.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-5.png","element":"img","alt":" γOR = supγ{γ ∈ (0, 1) : QOR(γ) ≤ α}","inline":true},{"text":". First, let ¯","element":"span"},{"style":{"height":16},"width":178.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-6.png","element":"img","alt":"α = QOR","inline":true},{"text":"(1), which represents the largest mFDR level that the oracle testing procedure can be. By part (a), ","element":"span"},{"style":{"height":17.6},"width":181.06,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-7.png","element":"img","alt":" QOR(γOR","inline":true},{"text":") is non–decreasing. Via the squeeze theorem, for all ","element":"span"},{"style":{"height":15.6},"width":220.91,"height":39,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-8.png","element":"img","alt":" α < ¯α, this","inline":true,"padRight":true},{"text":"implies that ","element":"span"},{"style":{"height":17.6},"width":297.97,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-9.png","element":"img","alt":" QOR(γOR) = α.","inline":true}],[{"text":"Next, consider the power of ","element":"span"},{"style":{"height":17.6},"width":707.23,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-10.png","element":"img","alt":" δOR = {I(Clfdri < γOR) : 1 ≤ i ≤ m}","inline":true,"padRight":true},{"text":"compared to that of an arbitrary decision rule ","element":"span"},{"style":{"height":19.13},"width":312.01,"height":47.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-11.png","element":"img","alt":" d∗ = (d1∗, . . . , dm∗ ","inline":true,"padRight":true},{"text":") such that ","element":"span"},{"style":{"height":17.6},"width":308.9,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-12.png","element":"img","alt":" mFDR(d∗) ≤ α","inline":true},{"text":". Using the previous result","element":"span"}],[{"text":"from part(a), it follows that","element":"span"}],[{"style":{"width":"74%"},"width":1322,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-13.png","element":"img"}],[{"text":"Take the difference of the two expressions to obtain","element":"span"}],[{"id":"id-72","style":{"width":"69%"},"width":1232,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-14.png","element":"img"}],[{"text":"Next apply a transformation ","element":"span"},{"style":{"height":19.91},"width":679.77,"height":49.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-15.png","element":"img","alt":" f(x) = (x − α)/(1 − x) to each δiOR","inline":true},{"text":". Note that because ","element":"span"},{"style":{"height":17.6},"width":142.69,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-16.png","element":"img","alt":" f′(x) =","inline":true,"padRight":true},{"text":"(1","element":"span"},{"style":{"height":19.13},"width":415.47,"height":47.83,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-17.png","element":"img","alt":"−α)/(1−x)2 > 0, f(x","inline":true},{"text":") is monotonically increasing. Then order is preserved: if Clfdr","element":"span"},{"style":{"height":13.2},"width":146.95,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-18.png","element":"img","alt":"i < γOR","inline":true,"padRight":true},{"text":"then ","element":"span"},{"style":{"height":17.6},"width":358.22,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-19.png","element":"img","alt":" f(Clfdri) < f(γOR","inline":true},{"text":") and likewise for Clfdr","element":"span"},{"style":{"height":19.79},"width":156.89,"height":49.47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-20.png","element":"img","alt":"ii > γOR","inline":true},{"text":". This means we can rewrite ","element":"span"},{"style":{"height":19.91},"width":124.83,"height":49.77,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-21.png","element":"img","alt":" δiOR =","inline":true},{"style":{"height":17.6},"width":1372.23,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-22.png","element":"img","alt":"I [{(Clfdri − α)/(1 − Clfdri)} < γOR], where γOR = (γOR − α)/(1 − γOR","inline":true},{"text":"). It will be useful to note that, from part (a), we have ","element":"span"},{"style":{"height":15.1},"width":212.18,"height":37.75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-23.png","element":"img","alt":" α < λOR <","inline":true,"padRight":true},{"text":"1, which implies that ","element":"span"},{"style":{"height":15.2},"width":168.94,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-24.png","element":"img","alt":" γOR > 0.","inline":true}],[{"id":"id-71","text":"Then,","element":"span"}],[{"style":{"width":"99%"},"width":1776,"height":222,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/43-25.png","element":"img"}],[{"style":{"width":"99%"},"width":1779,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-0.png","element":"img"}],[{"style":{"height":17.6},"width":646.95,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-1.png","element":"img","alt":"{(Clfdri − α)/(1 − Clfdri)} ≥ γOR","inline":true},{"text":". For both cases,","element":"span"}],[{"style":{"width":"51%"},"width":917,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-2.png","element":"img"}],[{"text":"Summing over all ","element":"span"},{"style":{"fontStyle":"italic"},"text":"m ","element":"span"},{"text":"terms and taking the expectation yields ","element":"span"},{"href":"#id-71","text":"(C.12)","element":"a"},{"text":".","element":"span"}],[{"text":"Combine ","element":"span"},{"href":"#id-72","text":"(C.11) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-71","text":"(C.12) ","element":"a"},{"text":"to obtain","element":"span"}],[{"style":{"width":"80%"},"width":1429,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-3.png","element":"img"}],[{"text":"Finally, since ","element":"span"},{"style":{"height":13.2},"width":123.06,"height":33,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-4.png","element":"img","alt":" γOR >","inline":true,"padRight":true},{"text":"0, it follows that ","element":"span"},{"style":{"height":20.8},"width":666.09,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-5.png","element":"img","alt":" E��mi=1(δiOR − di∗)(Clfdri − α)�>","inline":true,"padRight":true},{"text":"0. After distributing the (","element":"span"},{"style":{"height":19.91},"width":169.72,"height":49.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-6.png","element":"img","alt":"δiOR − di∗","inline":true},{"text":") term and separating the expectations for the sums of the two decision rules, ","element":"span"},{"text":"we apply the definition of ","element":"span"},{"style":{"height":20.8},"width":683.92,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-7.png","element":"img","alt":" ETP(δ) = E��mi=1 δi (Clfdri − α)�","inline":true},{"text":"to conclude that ","element":"span"},{"style":{"height":17.6},"width":257.56,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-8.png","element":"img","alt":" ETP(δOR) ≥","inline":true}],[{"style":{"fontStyle":"italic"},"text":"ETP","element":"span"},{"text":"(","element":"span"},{"style":{"height":17.6},"width":74.51,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2003.00113/images/44-9.png","element":"img","alt":"d∗).","inline":true}]]}],"_version":"3.3.2"},"paperNode":"$28:props:children:props:children:0:props:product"}]]