36:[["$","audio",null,{"id":"tts"}],["$","$L3b",null,{"paperID":"2002.01800","publisher":"arxiv","paperJSON":{"title":"Sharpe Ratio Analysis in High Dimensions: Residual-Based Nodewise Regression in Factor Models","paperID":"2002.01800","avgLineHeight":17.88,"imgScale":4,"sections":[{"heading":"Abstract","paragraphs":[[{"style":{"width":"89%"},"width":1677,"height":690,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/0-0.png","element":"img"}]]},{"heading":"1 Introduction","paragraphs":[[{"text":"One of the key issues in finance, especially in empirical asset pricing, is the trade-off between the returns and the risk of a portfolio. One important way to quantify such trade-off is via the Sharpe Ratio.","element":"span"}],[{"text":"We contribute to this literature by studying the case when the number of assets, namely ","element":"span"},{"text":"p","element":"span"},{"text":", grows with the time span of the portfolio, ","element":"span"},{"text":"n","element":"span"},{"text":". To obtain the Sharpe Ratio, and also its maximum, we make use of the asset return’s precision matrix. In order to get an estimate of the precision matrix for asset returns in a large portfolio, we propose that an approximate factor model governs the dynamics of excess returns. Hence, asset returns (excess returns over a risk-free asset) can be explained by an increasing but known number of factors with unknown idiosyncratic errors entering the linear relation in an additive way. One major difference with the previous literature is that, in our case, the precision matrix has to be sparse. Therefore, this is a hybrid method that combines factor models with high-dimensional econometrics.","element":"span"}],[{"text":"The first step in getting the Sharpe Ratio and its maximum involves the estimation of the precision matrix of the idiosyncratic terms (errors). Estimating the such precision matrix is not an easy task, and the simple nodewise regression idea as in ","element":"span"},{"href":"#id-0","text":"Meinshausen and B¨uhlmann ","element":"a"},{"href":"#id-0","text":"(2006) ","element":"a"},{"text":"is not feasible. Therefore, we provide a simple, feasible residual-based nodewise regression method to estimate the precision matrix of errors in a factor model setup even if ","element":"span"},{"text":"p > n","element":"span"},{"text":". This feasible residual-based nodewise regression is a new idea, and it is shown to be consistently estimating the precision matrix of the errors which is our first contribution. Next, we obtain consistent estimators to the precision matrix of asset returns, even if ","element":"span"},{"text":"p > n","element":"span"},{"text":", which is our second technical contribution. Although, we focus on factor models in asset pricing, our methodology can be applied to any situation where the interest is the precision matrix of the errors of a linear regression model.","element":"span"}],[{"text":"Next, by using the precision matrix estimator for returns we can link our technical analysis to the financial econometrics literature. We make three contributions towards Sharpe Ratio analysis. First, we consider the Sharpe Ratios in the global minimum-variance portfolio and Markowitz mean-variance portfolio. We develop consistent estimators even if ","element":"span"},{"text":"p > n","element":"span"},{"text":", and both dimensions diverge. Second, we consider the rate of convergence and consistency of the maximum Sharpe Ratio when the portfolio weights are normalized to one. Recently, ","element":"span"},{"href":"#id-1","text":"Maller and Turkington ","element":"a"},{"href":"#id-1","text":"(2002)","element":"a"},{"text":", and ","element":"span"},{"href":"#id-2","text":"Maller et al. ","element":"a"},{"href":"#id-2","text":"(2016) ","element":"a"},{"text":"analyze the limit with a fixed number of assets and extend that approach to a large number of assets, but a number less than the time span of the portfolio. Their papers make a key discovery: in the case of weight constraints (summing to one), the formula for the maximum Sharpe Ratio depends on a technical term, unlike the unconstrained maximum Sharpe Ratio case. Practitioners could obtain the minimum Sharpe Ratio instead of the maximum if they are using the unconstrained formula. Our paper extends their paper by analyzing two issues. First, the case if ","element":"span"},{"text":"p > n","element":"span"},{"text":", with both quantities growing to infinity, and second, by handling the uncertainty created by this technical term, which we can estimate and use to obtain a new constrained and consistent maximum Sharpe Ratio. The assumption of constant loadings in the factor model is clearly a constraint for portfolio analysis over longer horizons. However, the setup where ","element":"span"},{"text":"p > n ","element":"span"},{"text":"provides the statistical tools for us to analyze portfolios in short horizons and small samples as high-dimensional asymptotics can be seen as a good approximation for situations when ","element":"span"},{"text":"n ","element":"span"},{"text":"is small but ","element":"span"},{"text":"p ","element":"span"},{"text":"is large compared to ","element":"span"},{"text":"n","element":"span"},{"text":". Third, only in the case of ","element":"span"},{"text":"p << n","element":"span"},{"text":", we obtain the consistency of our nodewise-based maximum-out-of-sample Sharpe Ratio estimate, with both ","element":"span"},{"text":"p, n ","element":"span"},{"text":"growing to infinity and ","element":"span"},{"style":{"height":16},"width":114.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/1-0.png","element":"img","alt":" p/n →","inline":true,"padRight":true},{"text":"0. We also provide an analysis of the Sharpe Ratio with only portfolio weights estimated in the formula. In that way, we can see the effect of estimated portfolio on getting the optimal Sharpe Ratio. Our analysis shows this is possible when ","element":"span"},{"text":"p < n ","element":"span"},{"text":"only.","element":"span"}],[{"text":"1.1 ","element":"span"},{"text":"The Sparsity of the Precision Matrix","element":"span"}],[{"text":"There are several reasons motivating the assumption of sparsity of the precision matrix of the errors from the factor model. In technical terms, this is a convenient and widely used asymptotic tool when we want to consider high dimensional problems when ","element":"span"},{"text":"p > n","element":"span"},{"text":". The sparsity assumption on the precision matrix of errors gives rise to a direct way of estimating the precision matrix for the returns via Sherman-Morrison-Woodbury formula. We solve two technical issues with this assumption. First, consistent estimation of the precision matrix of returns is possible, yielding consistent estimation of the Sharpe Ratio and it’s maximum, even in constrained case. Also, as far as we know, in the case of ","element":"span"},{"text":"p > n","element":"span"},{"text":", we do not know any other consistent estimation results for global minimum variance and Markowitz portfolios, as well as the constrained maximum Sharpe Ratio in the literature.","element":"span"}],[{"text":"The sparsity assumption on the precision matrix of the errors from a factor model can be also justified in situations of interest in the empirical finance literature. First, even though we do not assume normality of the errors here, in this particular case the conditional independence of two errors given all the other errors, is represented by a zero entry in the precision matrix of errors. This is explained in p.1436-1439 of ","element":"span"},{"href":"#id-0","text":"Meinshausen and B¨uhlmann ","element":"a"},{"href":"#id-0","text":"(2006)","element":"a"},{"text":". So, in the case of normally distributed data, sparsity can be thought as a conditional independence restriction. When the errors follow an elliptical distribution, conditional uncorrelatedness of two errors amount to a zero cell in the precision matrix as discussed in Section 2.4 of ","element":"span"},{"href":"#id-3","text":"Fan et al. ","element":"a"},{"href":"#id-3","text":"(2018)","element":"a"},{"text":". The authors claim that sparse precision matrix may be more useful when we estimate a network of stocks, by taking out common factors from returns and analyzing the conditional independence among idiosyncratic components (errors). Finally, there are a number of recent papers in the literature showing that after removing common factors, the covariance matrix of the errors is “almost” block diagonal, yielding a sparse precision matrix; see, for example, ","element":"span"},{"href":"#id-4","text":"Fan et al. ","element":"a"},{"href":"#id-4","text":"(2016) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-5","text":"Brito et al. ","element":"a"},{"href":"#id-5","text":"(2018)","element":"a"},{"text":". When the covariance matrix is block-diagonal, the precision matrix can be computed by inverting the estimated covariance matrix, which in turn can be consistently estimated by several different methods. However, even in this case, there are potential benefits of estimating the precision matrix directly as shown in our simulations and empirical exercise; see also ","element":"span"},{"href":"#id-6","text":"Senneret et al. ","element":"a"},{"href":"#id-6","text":"(2016)","element":"a"},{"text":".","element":"span"}],[{"text":"1.2 ","element":"span"},{"text":"A Brief Review of the Literature and Main Takeaways","element":"span"}],[{"text":"In terms of the literature on nodewise regression and related methods, the most relevant papers are as follows. ","element":"span"},{"href":"#id-0","text":"Meinshausen and B¨uhlmann ","element":"a"},{"href":"#id-0","text":"(2006) ","element":"a"},{"text":"establish the nodewise regression approach and provide an optimality result when data are normally distributed. ","element":"span"},{"href":"#id-7","text":"Chang et al. ","element":"a"},{"href":"#id-7","text":"(2018) ","element":"a"},{"text":"extend the nodewise regression method to time-series data and build confidence intervals for the elements in the precision matrix. However, the goal of ","element":"span"},{"href":"#id-7","text":"Chang et al. ","element":"a"},{"href":"#id-7","text":"(2018) ","element":"a"},{"text":"only centers on the elements of the precision matrix, and there is no connection to factor models. Furthermore, their results are based on the precision matrix of observed data and not on the residuals of a first-stage estimator. Finally, the authors do not consider the case of maximum Sharpe Ratio, and it is not clear if their results are directly applicable to financial applications. ","element":"span"},{"href":"#id-8","text":"Caner and Kock ","element":"a"},{"href":"#id-8","text":"(2018) ","element":"a"},{"text":"establish uniform confidence intervals in the case of high-dimensional parameters in heteroskedastic setups using nodewise regression, but, as in the previous paper, there is no connection to factor models in empirical finance. ","element":"span"},{"href":"#id-9","text":"Callot et al. ","element":"a"},{"href":"#id-9","text":"(2021) ","element":"a"},{"text":"provide the variance, the risk, and the weight estimation of a portfolio via nodewise regression. They take the nodewise regression directly from ","element":"span"},{"href":"#id-0","text":"Meinshausen and B¨uhlmann ","element":"a"},{"href":"#id-0","text":"(2006) ","element":"a"},{"text":"and apply it to returns. However, they assume that the precision matrix of returns is sparse. Hence, it is more restrictive and less realistic than the method we propose. We combine factor models with the sparsity of the precision matrix of errors. As a consequence, our method is much more connected to typical empirical asset pricing models. Furthermore, we do not impose any sparsity on the precision matrix of returns. ","element":"span"},{"href":"#id-9","text":"Callot et al. ","element":"a"},{"href":"#id-9","text":"(2021) ","element":"a"},{"text":"also has no proofs about the estimation of the Sharpe Ratio.","element":"span"}],[{"text":"In terms of recent contributions to the literature on factor models and sparse regression, we highlight ","element":"span"},{"href":"#id-10","text":"Fan et al. ","element":"a"},{"href":"#id-10","text":"(2021)","element":"a"},{"text":". The authors consider the combination of factor models and sparse regression in a very general setting. More specifically, they analyze a panel data model with a factor structure and idiosyncratic terms that are sparsely related. They also provide an inference procedure designed to test hypotheses on the entries of the covariance matrix of the residuals of pre-estimated models, including principal component regressions. ","element":"span"},{"text":"Our paper differs from theirs in several directions. ","element":"span"},{"text":"First, ","element":"span"},{"href":"#id-10","text":"Fan et al. ","element":"a"},{"href":"#id-10","text":"(2021) ","element":"a"},{"text":"considers only the covariance matrix and not the precision matrix. ","element":"span"},{"text":"Second, their approach is not based on nodewise regressions. Finally, Sharpe Ratio estimation and portfolio allocation are not considered. A seminal paper is by ","element":"span"},{"href":"#id-11","text":"Gagliardini et al. ","element":"a"},{"href":"#id-11","text":"(2016)","element":"a"},{"text":", where they analyze time-varying risk premia in large portfolios with factor models. They develop a structural model, and can tie that to factor models, and after that, they can estimate time-varying risk-premia. One of their main assumptions is that the maximum eigenvalue of covariance matrix of errors in the factor structure can diverge. Also, they assume sparsity of covariance matrix of errors and observed factors in the factor model. We also use diverging eigenvalue assumption in Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i) in our paper, as well as an increasing number of factors here, but with the assumption of sparsity on the precision matrix of errors. ","element":"span"},{"href":"#id-13","text":"Gagliardini et al. ","element":"a"},{"href":"#id-13","text":"(2019) ","element":"a"},{"text":"develop a diagnostic test for omitted factors in factor models. They rely on residuals rather than errors for their tests. As clear in their analysis, working with residuals pose major difficulties. We also face the similar difficulty in our paper. Then, ","element":"span"},{"href":"#id-14","text":"Gagliardini et al. ","element":"a"},{"href":"#id-14","text":"(2020) ","element":"a"},{"text":"analyze large conditional factor models. They analyze conditional risk premia even when the number of assets dominate the ","element":"span"},{"href":"#id-3","text":"time span ","element":"a"},{"href":"#id-3","text":"of the ","element":"a"},{"text":"portfolio.","element":"span"}],[{"text":"In a recent paper, ","element":"span"},{"href":"#id-3","text":"Fan et al. ","element":"a"},{"href":"#id-3","text":"(2018) ","element":"a"},{"text":"use sparse precision matrix estimation with hidden factors. Their approach uses a Dantzig based constrained estimator for precision matrix. The main differences are that the type of estimator depends on magnitude of coefficients in the precision matrix, with larger coefficients, and that the rate of estimation slows down considerably as seen in their equation (2.12)-result 2. Also, they assume bounded-finite ","element":"span"},{"style":{"height":13.1},"width":44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/3-0.png","element":"img","alt":" l∞","inline":true,"padRight":true},{"text":"matrix norm, which is restrictive. We allow diverging matrix ","element":"span"},{"style":{"height":13.1},"width":44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/3-1.png","element":"img","alt":" l∞","inline":true,"padRight":true},{"text":"norm. Also they do not apply their results to Sharpe Ratio analysis in high dimensions as we do.","element":"span"}],[{"text":"Recently, important contributions have been obtained in this area by using shrinkage and factor models.","element":"span"}],[{"href":"#id-15","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-15","text":"(2017) ","element":"a"},{"text":"propose a nonlinear shrinkage estimator in which small eigenvalues of the sample covariance matrix are increased and large eigenvalues are decreased by a shrinkage formula. Their main contribution is the optimal shrinkage function, which they find by minimizing a loss function. The maximum out-of-sample Sharpe Ratio is an inverse function of this loss. ","element":"span"},{"text":"Their results cover the independent and identically distributed case and when ","element":"span"},{"style":{"height":16},"width":116.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/4-0.png","element":"img","alt":" p/n →","inline":true,"padRight":true},{"text":"(0","element":"span"},{"text":", ","element":"span"},{"text":"1) ","element":"span"},{"style":{"height":10},"width":27,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/4-1.png","element":"img","alt":" ∪","inline":true,"padRight":true},{"text":"(1","element":"span"},{"style":{"height":12.4},"width":88.96,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/4-2.png","element":"img","alt":", +∞","inline":true},{"text":"). For the analysis of mean-variance efficiency, ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019) ","element":"a"},{"text":"make a novel contribution in which they take a constrained optimization, maximize returns subject to the risk of the portfolio, and show that it is equivalent to an unconstrained objective function, where they minimize a scaled return of the portfolio error by choosing optimal weights. To obtain these weights, they use lasso regression and assume a sparse number of nonzero weights of the portfolio, and they analyze ","element":"span"},{"style":{"height":16},"width":120.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/4-3.png","element":"img","alt":" p/n →","inline":true,"padRight":true},{"text":"(0","element":"span"},{"text":", ","element":"span"},{"text":"1). They show that their method maximizes the expected return of the portfolio and satisfies the risk constraint. Their paper is an important result on its own. One key paper in the literature is by ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011) ","element":"a"},{"text":"which assumes an approximate factor model, but, on the other hand, the authors assume conditional sparsity-diagonality of the covariance matrix of errors. ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011) ","element":"a"},{"text":"show for the first time how to build a precision matrix of returns in a large portfolio via factor models. Therefore, it is a key paper in the high-dimensional econometrics literature.","element":"span"}],[{"text":"Regarding other papers, Ledoit and Wolf (2003,2004) propose a linear shrinkage estimator of the covariance matrix and apply it to portfolio optimization. ","element":"span"},{"href":"#id-15","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-15","text":"(2017) ","element":"a"},{"text":"shows that nonlinear shrinkage performs better in out-of-sample forecasts. ","element":"span"},{"href":"#id-18","text":"Lai et al. ","element":"a"},{"href":"#id-18","text":"(2011)","element":"a"},{"text":", and ","element":"span"},{"href":"#id-19","text":"Garlappi et al. ","element":"a"},{"href":"#id-19","text":"(2007) ","element":"a"},{"text":"approach the same problem from a Bayesian perspective by aiming to maximize a utility function tied to portfolio optimization. Another avenue of the literature improves the performance of the portfolios by introducing constraints on the weights. This type of literature is in the case of the global minimum-variance portfolio. Examples of works investigating this problem include ","element":"span"},{"href":"#id-20","text":"Jagannathan and Ma ","element":"a"},{"href":"#id-20","text":"(2003) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-21","text":"Fan et al. ","element":"a"},{"href":"#id-21","text":"(2012)","element":"a"},{"text":". We also see a combination of different portfolios proposed by ","element":"span"},{"href":"#id-22","text":"Kan and Zhou ","element":"a"},{"href":"#id-22","text":"(2007)","element":"a"},{"text":", and ","element":"span"},{"href":"#id-23","text":"Tu and Zhou ","element":"a"},{"href":"#id-23","text":"(2011)","element":"a"},{"text":". Very recently, ","element":"span"},{"href":"#id-24","text":"Ding et al. ","element":"a"},{"href":"#id-24","text":"(2021) ","element":"a"},{"text":"extended factor models to assumptions that are more consistent with principal components analysis. They provide consistent estimation of the risk of the portfolio under the sparsity of covariance of errors with a fixed number of factors. ","element":"span"},{"href":"#id-25","referenceIndex":3,"text":"Barras et al. ","element":"a"},{"href":"#id-25","referenceIndex":3,"text":"(2021)","element":"a"},{"text":", ","element":"span"},{"href":"#id-26","text":"Brodie et al. ","element":"a"},{"href":"#id-26","text":"(2009)","element":"a"},{"text":", ","element":"span"},{"href":"#id-27","text":"Chamberlain and Rothschild ","element":"a"},{"href":"#id-27","text":"(1983)","element":"a"},{"text":", ","element":"span"},{"href":"#id-28","text":"DeMiguel et al. ","element":"a"},{"href":"#id-28","text":"(2009)","element":"a"},{"text":", ","element":"span"},{"href":"#id-29","text":"Fan et al. ","element":"a"},{"href":"#id-29","text":"(2015) ","element":"a"},{"text":"analyze the mutual fund industry, sparsely constructed Markowitz portfolio, arbitrage and factor models in large portfolios, sparsely constructed mean-variance portfolios, and risks of large portfolios, respectively.","element":"span"}],[{"text":"1.3 ","element":"span"},{"text":"Organization of the Paper","element":"span"}],[{"text":"This paper is organized as follows. Section ","element":"span"},{"text":"2 ","element":"span"},{"text":"considers our assumptions and feasible precision matrix estimation for errors. Section ","element":"span"},{"text":"3 ","element":"span"},{"text":"provides the feasible precision matrix estimate for asset returns. Section ","element":"span"},{"text":"4 ","element":"span"},{"text":"analyzes consistency of the Sharpe Ratio in a portfolio with large number of assets in three different scenarios. Section ","element":"span"},{"href":"#id-30","text":"5 ","element":"a"},{"text":"provides simulations that compare several methods. Section ","element":"span"},{"text":"6 ","element":"span"},{"text":"presents an out-of-sample forecasting exercise. The main proofs are in the Supplement A, common proofs used for Theorems 3-8 are in Supplement B, Supplement C contains proofs related to section 4.4, and the Supplement D has a proof of mean-variance efficiency of a large portfolio in case of out-of-sample context, and some extra simulation results.","element":"span"}],[{"text":"1.4 ","element":"span"},{"text":"Notation","element":"span"}],[{"text":"Let ","element":"span"},{"style":{"height":16},"width":320,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-0.png","element":"img","alt":" ∥ν∥l1, ∥ν∥l2, ∥ν∥∞","inline":true,"padRight":true},{"text":"be the ","element":"span"},{"style":{"height":14},"width":151.64,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-1.png","element":"img","alt":" l1, l2, l∞,","inline":true,"padRight":true},{"text":"norms of a generic vector ","element":"span"},{"style":{"height":7.6},"width":25,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-2.png","element":"img","alt":" ν","inline":true},{"text":". ","element":"span"},{"text":"Let ","element":"span"},{"style":{"height":17.39},"width":84.32,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-3.png","element":"img","alt":" ∥v∥2n","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.16},"width":212.8,"height":45.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-4.png","element":"img","alt":" n−1 �nt=1 v2t","inline":true,"padRight":true},{"text":"which ","element":"span"},{"text":"is the prediction norm for an ","element":"span"},{"style":{"height":8},"width":67.48,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-5.png","element":"img","alt":" n ×","inline":true,"padRight":true},{"text":"1 vector ","element":"span"},{"text":"v","element":"span"},{"text":". ","element":"span"},{"text":"Let Eigmin(","element":"span"},{"text":"A","element":"span"},{"text":") represents the minimum eigenvalue of a matrix ","element":"span"},{"text":"A","element":"span"},{"text":", and Eigmax(","element":"span"},{"text":"A","element":"span"},{"text":") represent the maximum eigenvalue of the matrix ","element":"span"},{"text":"A","element":"span"},{"text":". For a generic matrix ","element":"span"},{"text":"A","element":"span"},{"text":", let ","element":"span"},{"style":{"height":16},"width":351.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-6.png","element":"img","alt":"∥A∥l1, ∥A∥l∞, ∥A∥l2","inline":true},{"text":", be the ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-7.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"induced matrix norm (i.e. maximum absolute column sum norm), ","element":"span"},{"style":{"height":13.1},"width":44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-8.png","element":"img","alt":" l∞","inline":true,"padRight":true},{"text":"induced matrix norm (i.e. maximum absolute row sum norm), spectral matrix norm, respectively. ","element":"span"},{"style":{"height":16},"width":106.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-9.png","element":"img","alt":" ∥A∥∞","inline":true,"padRight":true},{"text":"is maximum absolute value of element of a matrix, and also a norm (but not a matrix norm). Matrix norms have the additional desirable feature of submultiplicativity property. For further information on matrix norms, see p.341 of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson ","element":"a"},{"href":"#id-31","text":"(2013)","element":"a"},{"text":".","element":"span"}]]},{"heading":"2 Factor Model and Feasible Nodewise Regression","paragraphs":[[{"text":"We start with the following model for the ","element":"span"},{"text":"j","element":"span"},{"text":"th asset return (excess asset return) at time ","element":"span"},{"style":{"height":14.7},"width":90.24,"height":36.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-10.png","element":"img","alt":" t, yj,t","inline":true},{"text":", for ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-11.png","element":"img","alt":", · · · , p","inline":true},{"text":",","element":"span"}],[{"text":"and time periods ","element":"span"},{"text":"t ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":119.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-12.png","element":"img","alt":", · · · , n","inline":true},{"text":", such that","element":"span"}],[{"id":"id-32","style":{"width":"58%"},"width":1086,"height":51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-13.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.5},"width":33.64,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-14.png","element":"img","alt":" bj","inline":true,"padRight":true},{"text":"is a ","element":"span"},{"style":{"height":10.8},"width":74.2,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-15.png","element":"img","alt":" K ×","inline":true,"padRight":true},{"text":"1 vector of factor loadings, ","element":"span"},{"style":{"height":14.64},"width":39.36,"height":36.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-16.png","element":"img","alt":" f t","inline":true,"padRight":true},{"text":"is the ","element":"span"},{"style":{"height":10.8},"width":74.2,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-17.png","element":"img","alt":" K ×","inline":true,"padRight":true},{"text":"1 vector of common factors to all assets’ returns, and ","element":"span"},{"style":{"height":11.5},"width":57.6,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-18.png","element":"img","alt":" uj,t","inline":true,"padRight":true},{"text":"is the scalar error (idiosyncratic) term for asset return ","element":"span"},{"text":"j ","element":"span"},{"text":"at time ","element":"span"},{"text":"t","element":"span"},{"text":". All the factors are assumed to be observed. This model is used by ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":". From this point on, when asset return is mentioned, it should be understood as excess asset return.","element":"span"}],[{"text":"For the ","element":"span"},{"text":"j","element":"span"},{"text":"th asset return we can rewrite ","element":"span"},{"href":"#id-32","text":"(1) ","element":"a"},{"text":"in the vector form, for ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-19.png","element":"img","alt":", · · · , p","inline":true},{"text":":","element":"span"}],[{"id":"id-34","style":{"width":"57%"},"width":1075,"height":50,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-20.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"X ","element":"span"},{"text":"= (","element":"span"},{"style":{"height":14.83},"width":187.52,"height":37.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-21.png","element":"img","alt":"f 1, · · · , f n","inline":true},{"text":") is a ","element":"span"},{"style":{"height":10.8},"width":108.48,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-22.png","element":"img","alt":" K × n","inline":true,"padRight":true},{"text":"matrix, and ","element":"span"},{"style":{"height":13.63},"width":37.96,"height":34.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-23.png","element":"img","alt":" yj","inline":true,"padRight":true},{"text":"= (","element":"span"},{"style":{"height":16.7},"width":248.72,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-24.png","element":"img","alt":"yj,1, · · · , yj,n)′","inline":true,"padRight":true},{"text":"is a ","element":"span"},{"style":{"height":8},"width":63.16,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-25.png","element":"img","alt":" n ×","inline":true,"padRight":true},{"text":"1 vector of returns of the ","element":"span"},{"text":"j","element":"span"},{"text":"th asset. We can also express the same relation in a matrix form as follows:","element":"span"}],[{"id":"id-44","style":{"width":"56%"},"width":1064,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-26.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"Y ","element":"span"},{"text":"is a ","element":"span"},{"style":{"height":11.2},"width":93.6,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-27.png","element":"img","alt":" p × n","inline":true,"padRight":true},{"text":"matrix, ","element":"span"},{"text":"B ","element":"span"},{"text":"is a ","element":"span"},{"style":{"height":14},"width":105.6,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-28.png","element":"img","alt":" p × K","inline":true,"padRight":true},{"text":"matrix, and ","element":"span"},{"text":"U ","element":"span"},{"text":"is a ","element":"span"},{"style":{"height":11.2},"width":93.6,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-29.png","element":"img","alt":" p × n","inline":true,"padRight":true},{"text":"matrix. ","element":"span"},{"style":{"height":7.6},"width":16,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-30.png","element":"img","alt":"1","inline":true,"padRight":true},{"text":"Define the covariance matrix of the ","element":"span"},{"style":{"height":11.2},"width":59.8,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-31.png","element":"img","alt":" p ×","inline":true,"padRight":true},{"text":"1 vector of errors ","element":"span"},{"style":{"height":9.5},"width":39.36,"height":23.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-32.png","element":"img","alt":" ut","inline":true,"padRight":true},{"text":":= (","element":"span"},{"style":{"height":16.7},"width":404.24,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-33.png","element":"img","alt":"u1,t, · · · , uj,t, · · · , up,t)′","inline":true,"padRight":true},{"text":"as ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-34.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":19.2},"width":149,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/5-35.png","element":"img","alt":" E�utu′t�","inline":true},{"text":".","element":"span"}],[{"text":"We take ","element":"span"},{"style":{"height":16.03},"width":223.36,"height":40.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-0.png","element":"img","alt":" {(f t, ut)}nt=1","inline":true,"padRight":true},{"text":"to be a strictly stationary, ergodic, and strong mixing sequence of random variables. ","element":"span"},{"text":"Also, let ","element":"span"},{"style":{"height":17.39},"width":169.76,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-1.png","element":"img","alt":" F0−∞, F∞n","inline":true,"padRight":true},{"text":"be the ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-2.png","element":"img","alt":" Σ","inline":true},{"text":"- algebras generated by ","element":"span"},{"style":{"height":16.03},"width":170.72,"height":40.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-3.png","element":"img","alt":" {(f t, ut)}","inline":true},{"text":", for ","element":"span"},{"style":{"height":12.8},"width":180.28,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-4.png","element":"img","alt":" −∞ < t ≤","inline":true,"padRight":true},{"text":"0, and ","element":"span"},{"style":{"height":12.8},"width":184.96,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-5.png","element":"img","alt":" n ≤ t < ∞","inline":true},{"text":", respectively. ","element":"span"},{"text":"Denote the strong mixing coefficient as ","element":"span"},{"style":{"height":16},"width":64.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-6.png","element":"img","alt":" α(n","inline":true},{"text":") := sup","element":"span"},{"style":{"height":19.49},"width":647.48,"height":48.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-7.png","element":"img","alt":"A∈F 0−∞,B∈F ∞n |P(A)P(B) − P(A ∩ B)|.","inline":true}],[{"text":"In Assumption ","element":"span"},{"href":"#id-12","text":"7 ","element":"a"},{"text":"below, we assume that maximum eigenvalue of ","element":"span"},{"style":{"height":13.11},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-8.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"can grow with sample size, this is due to ","element":"span"},{"style":{"height":13.11},"width":52.64,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-9.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"being a ","element":"span"},{"style":{"height":11.2},"width":88.72,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-10.png","element":"img","alt":" p × p","inline":true,"padRight":true},{"text":"matrix where ","element":"span"},{"text":"p ","element":"span"},{"text":"may grow with ","element":"span"},{"text":"n","element":"span"},{"text":". We will assume sparsity for the precision matrix of errors ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-11.png","element":"img","alt":" Ω","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.54},"width":74.08,"height":46.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-12.png","element":"img","alt":" Σ−1n","inline":true,"padRight":true},{"text":", but we do not subscript ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-13.png","element":"img","alt":" Ω","inline":true,"padRight":true},{"text":"with ","element":"span"},{"text":"n ","element":"span"},{"text":"to avoid cumbersome notation. Each row of ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-14.png","element":"img","alt":" Ω","inline":true,"padRight":true},{"text":"will","element":"span"}],[{"style":{"width":"96%"},"width":1802,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-15.png","element":"img"}],[{"text":"where Ω","element":"span"},{"style":{"height":10},"width":32.56,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-16.png","element":"img","alt":"j,l","inline":true,"padRight":true},{"text":"represents the ","element":"span"},{"text":"l","element":"span"},{"text":"th element in the ","element":"span"},{"text":"j","element":"span"},{"text":"th row of ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-17.png","element":"img","alt":" Ω","inline":true},{"text":". Let ","element":"span"},{"style":{"height":17.23},"width":40.88,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-18.png","element":"img","alt":" Scj","inline":true,"padRight":true},{"text":"represents the index set of all zero elements ","element":"span"},{"text":"in the ","element":"span"},{"text":"j","element":"span"},{"text":"th row of ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-19.png","element":"img","alt":" Ω","inline":true},{"text":". Define the cardinality of the non-zero cells in the ","element":"span"},{"text":"j","element":"span"},{"text":"th row of the precision matrix as ","element":"span"},{"style":{"height":11.51},"width":31.72,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-20.png","element":"img","alt":"sj","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":16.71},"width":63.32,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-21.png","element":"img","alt":" |Sj|","inline":true},{"text":", which can be nondecreasing in ","element":"span"},{"text":"n","element":"span"},{"text":", but we do not subscript that with ","element":"span"},{"text":"n","element":"span"},{"text":". Denote the maximum number of nonzero elements across all rows ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-22.png","element":"img","alt":", · · · , p","inline":true,"padRight":true},{"text":"of the precision matrix ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-23.png","element":"img","alt":" Ω","inline":true,"padRight":true},{"text":"as ¯","element":"span"},{"text":"s ","element":"span"},{"text":":= max","element":"span"},{"style":{"height":11.51},"width":137.32,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-24.png","element":"img","alt":"1≤j≤p sj","inline":true},{"text":", which is nondecreasing in ","element":"span"},{"text":"n","element":"span"},{"text":".","element":"span"}],[{"text":"This last definition plays a key role in analysis of the rate of convergence of estimation errors. Note that, just to be clear, when ","element":"span"},{"style":{"height":8.8},"width":135.52,"height":22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-25.png","element":"img","alt":" n → ∞","inline":true},{"text":", we allow ","element":"span"},{"style":{"height":14},"width":307.84,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-26.png","element":"img","alt":" p → ∞, K → ∞","inline":true},{"text":", and ¯","element":"span"},{"style":{"height":8.8},"width":130.24,"height":22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-27.png","element":"img","alt":"s → ∞","inline":true},{"text":". As in the literature, we do not subscript them by ","element":"span"},{"text":"n","element":"span"},{"text":". Also, we allow for ","element":"span"},{"text":"p > n","element":"span"},{"text":", when ","element":"span"},{"style":{"height":11.6},"width":285.28,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-28.png","element":"img","alt":" n → ∞, p → ∞","inline":true},{"text":", and ","element":"span"},{"style":{"height":16},"width":175.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-29.png","element":"img","alt":" p/n → ∞","inline":true,"padRight":true},{"text":"in our analysis in Theorems 1-7, which can be considered ultra-high dimensional portfolio analysis. For future references, we","element":"span"}],[{"text":"denote all of the asset returns except the ","element":"span"},{"text":"j","element":"span"},{"text":"th one as","element":"span"}],[{"id":"id-35","style":{"width":"60%"},"width":1126,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-30.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.5},"width":74.92,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-31.png","element":"img","alt":" Y −j","inline":true},{"text":", of dimension (","element":"span"},{"style":{"height":10},"width":60.28,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-32.png","element":"img","alt":"p −","inline":true,"padRight":true},{"text":"1) ","element":"span"},{"style":{"height":8},"width":63.84,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-33.png","element":"img","alt":" × n","inline":true},{"text":", is the ","element":"span"},{"text":"Y ","element":"span"},{"text":"matrix without the ","element":"span"},{"text":"j","element":"span"},{"text":"th row, ","element":"span"},{"style":{"height":15.5},"width":74.44,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-34.png","element":"img","alt":" B−j","inline":true,"padRight":true},{"text":"is the (","element":"span"},{"style":{"height":10},"width":60.28,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-35.png","element":"img","alt":"p −","inline":true,"padRight":true},{"text":"1) ","element":"span"},{"style":{"height":10.8},"width":75.84,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-36.png","element":"img","alt":" × K","inline":true,"padRight":true},{"text":"matrix which is ","element":"span"},{"text":"B ","element":"span"},{"text":"without the ","element":"span"},{"text":"j","element":"span"},{"text":"th row, and ","element":"span"},{"style":{"height":15.5},"width":74.44,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-37.png","element":"img","alt":" U −j","inline":true,"padRight":true},{"text":"is the (","element":"span"},{"style":{"height":10},"width":58.36,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-38.png","element":"img","alt":"p −","inline":true,"padRight":true},{"text":"1) ","element":"span"},{"style":{"height":8},"width":62.4,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-39.png","element":"img","alt":" × n","inline":true,"padRight":true},{"text":"matrix given by ","element":"span"},{"text":"U ","element":"span"},{"text":"matrix without the ","element":"span"},{"text":"j","element":"span"},{"text":"th row.","element":"span"}],[{"text":"It has been well established in the literature that in case of known ","element":"span"},{"style":{"height":15.51},"width":105.64,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-40.png","element":"img","alt":" Uj, γj","inline":true},{"text":", which is essential input in nodewise regression, can be recovered with the following lasso problem, with a sequence ","element":"span"},{"style":{"height":13.11},"width":93.88,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-41.png","element":"img","alt":" λn >","inline":true,"padRight":true},{"text":"0, for all","element":"span"}],[{"id":"id-33","style":{"width":"99%"},"width":1869,"height":146,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-42.png","element":"img"}],[{"text":"The main issue with ","element":"span"},{"href":"#id-33","text":"(5) ","element":"a"},{"text":"is, unlike nodewise regression in ","element":"span"},{"href":"#id-8","text":"Caner and Kock ","element":"a"},{"href":"#id-8","text":"(2018)","element":"a"},{"text":", it is infeasible due to error terms regressed on each other. We now show how to turn this to feasible regression and still consistently estimate ","element":"span"},{"style":{"height":13.44},"width":38.92,"height":33.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-43.png","element":"img","alt":" γj","inline":true},{"text":".","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":189,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/6-44.png","element":"img"}],[{"text":"By equation ","element":"span"},{"href":"#id-34","text":"(2) ","element":"a"},{"text":"we can define the OLS residual as","element":"span"}],[{"id":"id-36","style":{"width":"70%"},"width":1323,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-0.png","element":"img"}],[{"text":"where ","element":"span"},{"text":"X ","element":"span"},{"text":"is a ","element":"span"},{"style":{"height":10.8},"width":108.96,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-1.png","element":"img","alt":" K × n","inline":true,"padRight":true},{"text":"matrix and","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":147,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-2.png","element":"img"}],[{"text":"Define the residuals by transposing ","element":"span"},{"href":"#id-35","text":"(4) ","element":"a"},{"text":"such that","element":"span"}],[{"id":"id-37","style":{"width":"74%"},"width":1387,"height":229,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-3.png","element":"img"}],[{"text":"Note that ","element":"span"},{"style":{"height":18.14},"width":74.44,"height":45.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-4.png","element":"img","alt":"�U′−j","inline":true,"padRight":true},{"text":"is a ","element":"span"},{"style":{"height":16},"width":149.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-5.png","element":"img","alt":" n × (p −","inline":true,"padRight":true},{"text":"1) matrix ","element":"span"},{"style":{"height":13.1},"width":76.92,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-6.png","element":"img","alt":" M X","inline":true,"padRight":true},{"text":"is a ","element":"span"},{"style":{"height":8},"width":97.44,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-7.png","element":"img","alt":" n × n","inline":true,"padRight":true},{"text":"matrix, and ","element":"span"},{"style":{"height":17.23},"width":74.44,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-8.png","element":"img","alt":" U ′−j","inline":true,"padRight":true},{"text":"is a ","element":"span"},{"style":{"height":16},"width":149.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-9.png","element":"img","alt":" n × (p −","inline":true,"padRight":true},{"text":"1) matrix. Next, use ","element":"span"},{"href":"#id-36","text":"(7) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-37","text":"(9)","element":"a"},{"text":":","element":"span"}],[{"id":"id-75","style":{"width":"58%"},"width":1100,"height":62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-10.png","element":"img"}],[{"text":"where","element":"span"}],[{"style":{"width":"99%"},"width":1871,"height":142,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-11.png","element":"img"}],[{"text":"of the residuals affect the consistent estimation of ","element":"span"},{"style":{"height":13.63},"width":38.92,"height":34.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-12.png","element":"img","alt":" γj","inline":true},{"text":". We define a feasible nodewise estimator","element":"span"}],[{"id":"id-54","style":{"width":"99%"},"width":1868,"height":371,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-13.png","element":"img"}],[{"text":"Now, to form the ","element":"span"},{"text":"j","element":"span"},{"text":"th row of ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-14.png","element":"img","alt":"�Ω","inline":true},{"text":", set the ","element":"span"},{"text":"j","element":"span"},{"text":"th element in the ","element":"span"},{"text":"j","element":"span"},{"text":"th row as","element":"span"}],[{"style":{"width":"57%"},"width":1079,"height":268,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-15.png","element":"img"}],[{"text":"We want to show that for each ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-16.png","element":"img","alt":", · · · , p","inline":true},{"text":", ","element":"span"},{"style":{"height":17.95},"width":47.12,"height":44.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-17.png","element":"img","alt":"�Ω′j","inline":true,"padRight":true},{"text":"is consistent. We can write ","element":"span"},{"style":{"height":17.95},"width":47.12,"height":44.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-18.png","element":"img","alt":" �Ω′j","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":19.79},"width":109.6,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-19.png","element":"img","alt":" �C′j/�τ2j","inline":true,"padRight":true},{"text":"with ","element":"span"},{"style":{"height":17.95},"width":49.52,"height":44.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-20.png","element":"img","alt":" �C′j","inline":true,"padRight":true},{"text":"being an ","element":"span"},{"text":"1 ","element":"span"},{"style":{"height":11.2},"width":61.84,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-21.png","element":"img","alt":" × p","inline":true,"padRight":true},{"text":"matrix of ones in ","element":"span"},{"text":"j","element":"span"},{"text":"th cell and ","element":"span"},{"style":{"height":14.11},"width":71.12,"height":35.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/7-22.png","element":"img","alt":" −�γ′j","inline":true,"padRight":true},{"text":"in the other cells.","element":"span"}],[{"text":"2.1 ","element":"span"},{"text":"Assumptions and a Key Result","element":"span"}],[{"text":"In this part, we provide the assumptions that will be needed for consistency for the ","element":"span"},{"text":"j","element":"span"},{"text":"th row of the precision matrix estimator. Let ","element":"span"},{"style":{"height":11.51},"width":57.6,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-0.png","element":"img","alt":" uj,t","inline":true,"padRight":true},{"text":"be the ","element":"span"},{"text":"j ","element":"span"},{"text":"the element of the ","element":"span"},{"style":{"height":11.2},"width":61.24,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-1.png","element":"img","alt":" p ×","inline":true,"padRight":true},{"text":"1 vector ","element":"span"},{"style":{"height":9.51},"width":39.36,"height":23.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-2.png","element":"img","alt":" ut","inline":true},{"text":". Similarly, ","element":"span"},{"style":{"height":11.91},"width":86.88,"height":29.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-3.png","element":"img","alt":" u−j,t","inline":true,"padRight":true},{"text":"is the (","element":"span"},{"style":{"height":10},"width":61.72,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-4.png","element":"img","alt":"p −","inline":true,"padRight":true},{"text":"1) ","element":"span"},{"style":{"height":8},"width":31,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-5.png","element":"img","alt":" ×","inline":true,"padRight":true},{"text":"1 vector of errors in ","element":"span"},{"text":"t","element":"span"},{"text":"th time period, except the ","element":"span"},{"text":"j","element":"span"},{"text":"th term in ","element":"span"},{"style":{"height":9.5},"width":38.88,"height":23.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-6.png","element":"img","alt":" ut","inline":true},{"text":". Define ","element":"span"},{"style":{"height":11.5},"width":54.24,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-7.png","element":"img","alt":" ηj,t","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":13.82},"width":235.72,"height":34.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-8.png","element":"img","alt":" uj,t − u′−j,tγj","inline":true},{"text":".","element":"span"}],[{"id":"id-38","text":"Assumption 1. ","element":"span"},{"text":"(i). ","element":"span"},{"style":{"height":16.03},"width":286.72,"height":40.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-9.png","element":"img","alt":" {ut}nt=1, {f t}nt=1","inline":true,"padRight":true},{"text":"are sequences of (strictly) stationary and ergodic random variables. ","element":"span"},{"text":"Furthermore, ","element":"span"},{"style":{"height":16},"width":286.72,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-10.png","element":"img","alt":" {ut}nt=1, {f t}nt=1","inline":true,"padRight":true},{"text":"are independent. ","element":"span"},{"style":{"height":9.5},"width":39.36,"height":23.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-11.png","element":"img","alt":" ut","inline":true,"padRight":true},{"text":"is a (","element":"span"},{"style":{"height":11.2},"width":61.24,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-12.png","element":"img","alt":"p ×","inline":true,"padRight":true},{"text":"1","element":"span"},{"text":") zero mean random vector with covariance ","element":"span"},{"text":"matrix ","element":"span"},{"style":{"height":16},"width":177.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-13.png","element":"img","alt":" Σn (p × p","inline":true},{"text":"). ","element":"span"},{"text":"Eigmin(","element":"span"},{"style":{"height":16},"width":184.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-14.png","element":"img","alt":"Σn) ≥ c >","inline":true,"padRight":true},{"text":"0","element":"span"},{"text":", with ","element":"span"},{"text":"c ","element":"span"},{"text":"a positive constant, and ","element":"span"},{"text":"max","element":"span"},{"style":{"height":28.8},"width":415.84,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-15.png","element":"img","alt":"1≤j≤p E�u2j,t�≤ C < ∞","inline":true},{"text":". (ii). For the strong mixing variables ","element":"span"},{"style":{"height":16},"width":243.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-16.png","element":"img","alt":" f t, ut: α(t) ≤","inline":true,"padRight":true},{"text":"exp(","element":"span"},{"style":{"height":10.8},"width":105.68,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-17.png","element":"img","alt":"−Ctr0","inline":true},{"text":")","element":"span"},{"text":", for a positive constant ","element":"span"},{"style":{"height":11.1},"width":78.04,"height":27.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-18.png","element":"img","alt":" r0 >","inline":true,"padRight":true},{"text":"0","element":"span"},{"text":".","element":"span"}],[{"id":"id-40","text":"Assumption 2. ","element":"span"},{"text":"There exists positive constants ","element":"span"},{"style":{"height":12},"width":202.84,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-19.png","element":"img","alt":" r1, r2, r3 >","inline":true,"padRight":true},{"text":"0 ","element":"span"},{"text":"and another set of positive constants ","element":"span"},{"style":{"height":14},"width":108.16,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-20.png","element":"img","alt":" B1, b2","inline":true},{"text":",","element":"span"}],[{"style":{"width":"100%"},"width":1873,"height":598,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-21.png","element":"img"}],[{"text":"3","element":"span"},{"style":{"height":18.54},"width":257.28,"height":46.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-22.png","element":"img","alt":"r−13 + r−10 > 1.","inline":true}],[{"id":"id-39","style":{"width":"95%"},"width":1791,"height":143,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-23.png","element":"img"}],[{"id":"id-41","text":"Assumption 4. ","element":"span"},{"text":"(i). ","element":"span"},{"text":"Eigmin[cov(","element":"span"},{"style":{"height":16},"width":185.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-24.png","element":"img","alt":"f t)] ≥ c >","inline":true,"padRight":true},{"text":"0","element":"span"},{"text":", with ","element":"span"},{"text":"cov(","element":"span"},{"style":{"height":14.64},"width":39.36,"height":36.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-25.png","element":"img","alt":"f t","inline":true},{"text":") ","element":"span"},{"text":"being the covariance matrix of the factors ","element":"span"},{"style":{"height":14.64},"width":38.88,"height":36.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-26.png","element":"img","alt":" f t","inline":true},{"text":", ","element":"span"},{"text":"t ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":119.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-27.png","element":"img","alt":", · · · , n","inline":true},{"text":". (ii). ","element":"span"},{"text":"max","element":"span"},{"style":{"height":19.2},"width":422.08,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-28.png","element":"img","alt":"1≤k≤K E�f 2kt�≤ C < ∞","inline":true},{"text":", ","element":"span"},{"text":"min","element":"span"},{"style":{"height":19.2},"width":344.44,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-29.png","element":"img","alt":"1≤k≤k E�f 2kt�≥ c >","inline":true,"padRight":true},{"text":"0","element":"span"},{"text":". (iii). ","element":"span"},{"text":"max","element":"span"},{"style":{"height":28.8},"width":363.64,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-30.png","element":"img","alt":"1≤j≤p E�η2j,t�≤ C <","inline":true}],[{"id":"id-42","style":{"width":"80%"},"width":1516,"height":136,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-31.png","element":"img"}],[{"text":"Note that Assumptions ","element":"span"},{"href":"#id-38","text":"1-","element":"a"},{"href":"#id-39","text":"3 ","element":"a"},{"text":"are standard assumptions and are used in ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011) ","element":"a"},{"text":"as well. Also, we","element":"span"}],[{"style":{"width":"99%"},"width":1870,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/8-32.png","element":"img"}],[{"text":"that, Stationary GARCH models with finite second moments and continuous error distributions, as well as causal ARMA processes with continuous error distributions, and a certain class of stationary Markov chains satisfy our Assumptions ","element":"span"},{"href":"#id-38","text":"1-","element":"a"},{"href":"#id-40","text":"2 ","element":"a"},{"text":"and are discussed in p.61 of ","element":"span"},{"href":"#id-7","text":"Chang et al. ","element":"a"},{"href":"#id-7","text":"(2018)","element":"a"},{"text":". ","element":"span"},{"href":"#id-7","text":"Chang et al. ","element":"a"},{"href":"#id-7","text":"(2018) ","element":"a"},{"text":"also uses similar assumptions.","element":"span"}],[{"text":"Assumption ","element":"span"},{"href":"#id-41","text":"4(","element":"a"},{"text":"i)-(ii) is also used in ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":", and the nodewise error assumption ","element":"span"},{"href":"#id-41","text":"4(","element":"a"},{"text":"iii) is used in ","element":"span"},{"href":"#id-8","text":"Caner and Kock ","element":"a"},{"href":"#id-8","text":"(2018)","element":"a"},{"text":". Assumption ","element":"span"},{"href":"#id-42","text":"5 ","element":"a"},{"text":"shows the interaction of sparsity of the precision matrix with factors. They both contribute negatively to biases that our analysis will show below.","element":"span"}],[{"text":"Before the next theorem, we define ","element":"span"},{"style":{"height":13.1},"width":43.04,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-0.png","element":"img","alt":" λn","inline":true,"padRight":true},{"text":"formally. Let ","element":"span"},{"text":"C > ","element":"span"},{"text":"0 be a generic positive constant, then","element":"span"}],[{"style":{"width":"67%"},"width":1267,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-1.png","element":"img"}],[{"text":"where we specify tuning parameter in Lemma A.5 in Supplement, and the asymptotic negligibility is by Assumption 5. Note ","element":"span"},{"href":"#id-9","text":"that in tunin","element":"a"},{"href":"#id-9","text":"g para","element":"a"},{"text":"meter ","element":"span"},{"style":{"height":13.1},"width":43.04,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-2.png","element":"img","alt":" λn","inline":true},{"text":", the first term involving ","element":"span"},{"style":{"height":13.36},"width":52.96,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-3.png","element":"img","alt":" K2","inline":true,"padRight":true},{"text":"is due to nodewise regression","element":"span"}],[{"text":"via factor models. In ","element":"span"},{"href":"#id-9","text":"Callot et al. ","element":"a"},{"href":"#id-9","text":"(2021)","element":"a"},{"text":", without factor models, they have the second term only","element":"span"},{"style":{"height":28.8},"width":40,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-4.png","element":"img","alt":"�","inline":true}],[{"text":"now provide one of the main Theorems in the paper. Theorem provides consistent estimates for the rows of the precision matrix of errors.","element":"span"}],[{"style":{"width":"84%"},"width":1576,"height":253,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-5.png","element":"img"}],[{"text":"1. Note that ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":15.5},"width":113.8,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-6.png","element":"img","alt":"Ωj, Ωj","inline":true,"padRight":true},{"text":"are not columns of ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":14},"width":83.88,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-7.png","element":"img","alt":"Ω, Ω","inline":true,"padRight":true},{"text":"respectively. ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":15.5},"width":113.8,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-8.png","element":"img","alt":"Ωj, Ωj","inline":true,"padRight":true},{"text":"are column representation of row vectors","element":"span"}],[{"style":{"width":"18%"},"width":349,"height":55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-9.png","element":"img"}],[{"text":"2. As long as Assumption ","element":"span"},{"href":"#id-42","text":"5 ","element":"a"},{"text":"is maintained, the rate of approximation error in Theorem ","element":"span"},{"href":"#id-43","text":"1 ","element":"a"},{"text":"matches the","element":"span"}],[{"style":{"width":"94%"},"width":1771,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-10.png","element":"img"}]]},{"heading":"3 Precision Matrix Estimate for The Returns","paragraphs":[[{"text":"Assuming orthogonality between factors and the idiosyncratic errors, the (","element":"span"},{"style":{"height":11.2},"width":94,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-11.png","element":"img","alt":"p × p","inline":true},{"text":") covariance matrix of the asset returns is defined as:","element":"span"}],[{"style":{"width":"61%"},"width":1147,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-12.png","element":"img"}],[{"text":"We start with the precision matrix formula for the asset returns, based on factor model that we used. Using Sherman-Morrison-Woodbury formula, as in p.13 of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson ","element":"a"},{"href":"#id-31","text":"(2013)","element":"a"},{"text":", ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-13.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-14.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":"is defined as:","element":"span"}],[{"style":{"width":"71%"},"width":1334,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-15.png","element":"img"}],[{"text":"and the precision matrix estimator for the returns is","element":"span"}],[{"id":"id-45","style":{"width":"72%"},"width":1365,"height":82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-16.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.5},"width":93.28,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-17.png","element":"img","alt":"�Ωsym","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":19.31},"width":86.8,"height":48.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-18.png","element":"img","alt":"�Ω+�Ω′2","inline":true,"padRight":true},{"text":"is the symmetrized version of our feasible nodewise regression estimator for the precision matrix for errors. ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-19.png","element":"img","alt":"�","inline":true},{"text":"cov(","element":"span"},{"style":{"height":14.64},"width":39.36,"height":36.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-20.png","element":"img","alt":"f t","inline":true},{"text":") = ","element":"span"},{"style":{"height":17.2},"width":460.4,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-21.png","element":"img","alt":" n−1XX′ − n−2X1n1′nX′","inline":true,"padRight":true},{"text":"is the estimator for the covariance matrix ","element":"span"},{"text":"of returns, and it is given in p.3327 of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011) ","element":"a"},{"text":"with ","element":"span"},{"style":{"height":12.7},"width":42.56,"height":31.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-22.png","element":"img","alt":" 1n","inline":true,"padRight":true},{"text":"representing a (","element":"span"},{"style":{"height":8},"width":63.16,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-23.png","element":"img","alt":"n ×","inline":true,"padRight":true},{"text":"1) vector of ones. Also, ","element":"span"},{"style":{"height":10.8},"width":35,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-24.png","element":"img","alt":"�B","inline":true,"padRight":true},{"text":"= (","element":"span"},{"style":{"height":17.36},"width":269.44,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-25.png","element":"img","alt":"Y X′)(XX′)−1","inline":true,"padRight":true},{"text":"is the least-squares estimator for the factor model in ","element":"span"},{"href":"#id-44","text":"(3)","element":"a"},{"text":". In addition, ","element":"span"},{"style":{"height":10.8},"width":35,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-26.png","element":"img","alt":" �B","inline":true,"padRight":true},{"text":"is a (","element":"span"},{"style":{"height":14},"width":104.16,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/9-27.png","element":"img","alt":"p × K","inline":true},{"text":") matrix, and ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-0.png","element":"img","alt":"�","inline":true},{"text":"cov(","element":"span"},{"style":{"height":14.83},"width":38.88,"height":37.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-1.png","element":"img","alt":"f t","inline":true},{"text":") is a ","element":"span"},{"style":{"height":10.8},"width":127.2,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-2.png","element":"img","alt":" K × K","inline":true,"padRight":true},{"text":"matrix. Note that we use a symmetric version of our precision matrix estimator for errors in the term in square brackets in equation ","element":"span"},{"href":"#id-45","text":"(19)","element":"a"},{"text":". There is a technical reason behind that. The proofs depend on the symmetry of the matrix in the square brackets in ","element":"span"},{"href":"#id-45","text":"(19)","element":"a"},{"text":", but the other parts in the proof do not need symmetry of the precision matrix estimator. Hence, we use both symmetrized, ","element":"span"},{"style":{"height":15.5},"width":93.28,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-3.png","element":"img","alt":"�Ωsym","inline":true,"padRight":true},{"text":"and standard (non-symmetric version) of the precision matrix estimator, ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-4.png","element":"img","alt":"�Ω","inline":true},{"text":". We want to rewrite the precision","element":"span"}],[{"text":"matrix and it’s estimator so that it’s convenient to analyze them technically. In this respect, define","element":"span"}],[{"style":{"width":"35%"},"width":657,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-5.png","element":"img"}],[{"text":"and","element":"span"}],[{"style":{"width":"38%"},"width":720,"height":82,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-6.png","element":"img"}],[{"text":"As a consequence,","element":"span"}],[{"id":"id-46","style":{"width":"65%"},"width":1225,"height":51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-7.png","element":"img"}],[{"text":"We need to find max","element":"span"},{"style":{"height":16.7},"width":299.68,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-8.png","element":"img","alt":"1≤j≤p ∥�Γj − Γj∥1","inline":true,"padRight":true},{"text":"where ","element":"span"},{"style":{"height":17.04},"width":41.36,"height":42.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-9.png","element":"img","alt":" Γ′j","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":17.95},"width":41.36,"height":44.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-10.png","element":"img","alt":" �Γ′j","inline":true,"padRight":true},{"text":"are the 1 ","element":"span"},{"style":{"height":11.2},"width":62.32,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-11.png","element":"img","alt":" × p","inline":true,"padRight":true},{"text":"dimensional rows of the precision ","element":"span"},{"text":"matrix of the returns and its estimator, respectively. ","element":"span"},{"style":{"height":15.5},"width":40.84,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-12.png","element":"img","alt":" Γj","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":15.5},"width":40.36,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-13.png","element":"img","alt":"�Γj","inline":true,"padRight":true},{"text":"are simply transposes of these rows which","element":"span"}],[{"text":"are ","element":"span"},{"style":{"height":11.2},"width":59.8,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-14.png","element":"img","alt":" p ×","inline":true,"padRight":true},{"text":"1. In this respect, using ","element":"span"},{"href":"#id-46","text":"(20) ","element":"a"},{"text":"we have that","element":"span"}],[{"id":"id-47","style":{"width":"87%"},"width":1631,"height":77,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-15.png","element":"img"}],[{"text":"Our aim is to simplify and get rates of convergence for the right side term in ","element":"span"},{"href":"#id-47","text":"(21)","element":"a"},{"text":". To get consistency and rate of convergence results for the precision matrix for returns, rather than the errors as in Theorem ","element":"span"},{"href":"#id-43","text":"1 ","element":"a"},{"text":"above, we need the following assumption on factor loadings.","element":"span"}],[{"id":"id-48","style":{"width":"47%"},"width":897,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-16.png","element":"img"}],[{"text":"(i). ","element":"span"},{"text":"max","element":"span"},{"style":{"height":10},"width":97.16,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-17.png","element":"img","alt":"1≤j≤p","inline":true,"padRight":true},{"text":"max","element":"span"},{"style":{"height":16.7},"width":384,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-18.png","element":"img","alt":"1≤k≤K |bjk| ≤ C < ∞.","inline":true}],[{"style":{"width":"99%"},"width":1865,"height":115,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-19.png","element":"img"}],[{"text":"Also, a strengthened assumption on sparsity compared to Assumption ","element":"span"},{"href":"#id-42","text":"5 ","element":"a"},{"text":"is provided.","element":"span"}],[{"id":"id-12","style":{"width":"99%"},"width":1868,"height":533,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/10-20.png","element":"img"}],[{"style":{"width":"0%"},"width":7,"height":2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-0.png","element":"img"}],[{"text":"Specifically, the rate ","element":"span"},{"style":{"height":13.1},"width":32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-1.png","element":"img","alt":" ln","inline":true,"padRight":true},{"text":"is the rate of estimation error for ","element":"span"},{"style":{"height":16},"width":187.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-2.png","element":"img","alt":" ∥�L − L∥l∞","inline":true,"padRight":true},{"text":"as in Lemma A.13 in Supplement A. Note that Assumption ","element":"span"},{"href":"#id-48","text":"6 ","element":"a"},{"text":"is used in ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":". Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i) is used in ","element":"span"},{"href":"#id-11","text":"Gagliardini et al. ","element":"a"},{"href":"#id-11","text":"(2016)","element":"a"},{"text":". Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i) allows for the maximal eigenvalue of ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-3.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"to grow with ","element":"span"},{"text":"n","element":"span"},{"text":". In the special case of a diagonal ","element":"span"},{"style":{"height":13.11},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-4.png","element":"img","alt":"Σn","inline":true},{"text":", due to Assumption ","element":"span"},{"href":"#id-38","text":"1(","element":"a"},{"text":"i), the maximum eigenvalue of a diagonal ","element":"span"},{"style":{"height":13.11},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-5.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"matrix is finite. However, a diagonal matrix of variance of errors case is empirically less relevant and less realistic. ","element":"span"},{"text":"We expect the errors to be correlated across assets. ","element":"span"},{"text":"For an example of where the maximum eigenvalue of ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-6.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"may diverge, we show that this may be the case for block diagonal matrix structure for ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-7.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"in ","element":"span"},{"href":"#id-49","text":"(24)","element":"a"},{"text":". Note that ","element":"span"},{"href":"#id-50","text":"Shanken ","element":"a"},{"href":"#id-50","text":"(1992) ","element":"a"},{"text":"criticizes standard Arbitrage Pricing Theory since eigenvalue of the residual covariances must be bounded even when the number of assets diverge. Our Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i) moves away from maximum bounded eigenvalue assumption. Our residual covariances approximate error covariances very well and this can be seen in ","element":"span"},{"href":"#id-51","text":"(A.40) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-52","text":"(A.41) ","element":"a"},{"text":"in Supplement A.","element":"span"}],[{"text":"Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"ii) is a sparsity assumption which tradeoffs between maximal eigenvalue and the sparsity of the precision matrix. This assumption is needed to analyze the precision matrix for the asset returns. To give an example, ignoring constants, we can have ¯","element":"span"},{"text":"s ","element":"span"},{"text":"= ln(","element":"span"},{"text":"n","element":"span"},{"text":")","element":"span"},{"text":", K ","element":"span"},{"text":"= ln(","element":"span"},{"text":"n","element":"span"},{"text":")","element":"span"},{"text":", p ","element":"span"},{"text":"= 2","element":"span"},{"text":"n","element":"span"},{"text":", and ","element":"span"},{"style":{"height":17.36},"width":246.09,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-8.png","element":"img","alt":" rn = n1/5, λn","inline":true,"padRight":true},{"text":"=","element":"span"}],[{"text":"O","element":"span"},{"text":"[max(ln(","element":"span"},{"style":{"height":19.2},"width":191.2,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-9.png","element":"img","alt":"n)7/2/n,�","inline":true},{"text":"ln(","element":"span"},{"text":"n","element":"span"},{"text":")","element":"span"},{"text":"/n","element":"span"},{"text":"]. Then, Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"ii) is satisfied","element":"span"}],[{"style":{"width":"99%"},"width":1867,"height":228,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-10.png","element":"img"}],[{"text":"main results, which is the consistent estimation of the precision matrix for asset returns. Since the precision","element":"span"}],[{"text":"matrix of asset returns is in the formula of the Sharpe Ratio, as will be shown in Section 4, this theorem is crucial for subsequent analysis.","element":"span"}],[{"id":"id-88","style":{"width":"75%"},"width":1414,"height":494,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-11.png","element":"img"}],[{"text":"1. This theorem merges two key concepts: factor models and nodewise regression in high dimensional","element":"span"}],[{"style":{"width":"94%"},"width":1772,"height":181,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-12.png","element":"img"}],[{"text":"2. Although we focus on factor models in empirical asset pricing, the vector ","element":"span"},{"style":{"height":14.64},"width":39.36,"height":36.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-13.png","element":"img","alt":" f t","inline":true,"padRight":true},{"text":"can be seen as any set of","element":"span"}],[{"style":{"width":"41%"},"width":786,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/11-14.png","element":"img"}],[{"text":"3.1 ","element":"span"},{"text":"Two examples relating precision matrix restrictions to covariance matrix","element":"span"}],[{"text":"We now illustrate how specific structures of the covariance matrix are compatible with the sparsity assumption for the precision matrix. We provide two examples for errors, one block-diagonal covariance matrix for errors, and the other one is the Toeplitz form for the covariance matrix of errors. Then, we provide how they affect the precision matrix and Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i).","element":"span"}],[{"text":"3.1.1 ","element":"span"},{"text":"Block Diagonal Covariance Matrix for Errors","element":"span"}],[{"text":"Suppose that there are ","element":"span"},{"text":"m ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":14},"width":137.04,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-0.png","element":"img","alt":", · · · , M","inline":true,"padRight":true},{"text":"blocks in a ","element":"span"},{"style":{"height":11.2},"width":90.64,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-1.png","element":"img","alt":" p × p","inline":true,"padRight":true},{"text":"covariance matrix of the errors.","element":"span"}],[{"style":{"width":"92%"},"width":1741,"height":870,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-2.png","element":"img"}],[{"text":"The sparsity assumption – Assumption ","element":"span"},{"href":"#id-38","text":"1 ","element":"a"},{"text":"– for ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-3.png","element":"img","alt":" Ω","inline":true,"padRight":true},{"text":"can be translated into ","element":"span"},{"style":{"height":13.11},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-4.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"as max","element":"span"},{"style":{"height":10},"width":126.08,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-5.png","element":"img","alt":"1≤m≤M","inline":true,"padRight":true},{"text":"max","element":"span"},{"style":{"height":11.51},"width":192.04,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-6.png","element":"img","alt":"1≤j≤pm spm","inline":true,"padRight":true},{"text":"= ¯","element":"span"},{"text":"s","element":"span"},{"text":", where this is the maximum number of nonzero cells in a given row of a block, across all blocks. For Assumption ","element":"span"},{"href":"#id-12","text":"7 ","element":"a"},{"text":"we need the following inequality from Corollary 6.1.5 of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson ","element":"a"},{"href":"#id-31","text":"(2013)","element":"a"},{"text":", by seeing that spectral radius of a matrix is larger than or equal to absolute value of any eigenvalue for any square matrix","element":"span"}],[{"text":"A","element":"span"},{"text":". Therefore,","element":"span"}],[{"id":"id-53","style":{"width":"66%"},"width":1239,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-7.png","element":"img"}],[{"text":"For the same inequality also see Theorem 5.6.9a of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson ","element":"a"},{"href":"#id-31","text":"(2013)","element":"a"},{"text":". Relating to Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i)","element":"span"}],[{"style":{"width":"58%"},"width":1100,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-8.png","element":"img"}],[{"text":"where Σ","element":"span"},{"style":{"height":9.6},"width":94.64,"height":24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-9.png","element":"img","alt":"n,j1,j2","inline":true,"padRight":true},{"text":"is the ","element":"span"},{"style":{"height":13.6},"width":84.16,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-10.png","element":"img","alt":" j1, j2","inline":true,"padRight":true},{"text":"element of covariance matrix of errors. By ","element":"span"},{"href":"#id-53","text":"(23)","element":"a"},{"text":", this last inequality becomes","element":"span"}],[{"style":{"width":"46%"},"width":864,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-11.png","element":"img"}],[{"text":"It is easy to see that using Assumption ","element":"span"},{"href":"#id-38","text":"1(","element":"a"},{"text":"i), and under sufficient conditions for Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i), with","element":"span"}],[{"id":"id-49","style":{"width":"99%"},"width":1871,"height":150,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/12-12.png","element":"img"}],[{"text":"we get Eigmax(","element":"span"},{"style":{"height":16},"width":354.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-0.png","element":"img","alt":"Σn) ≤ Crn, rn/p →","inline":true,"padRight":true},{"text":"0. This allows the size of the blocks to be increasing with ","element":"span"},{"text":"p","element":"span"},{"text":", but the ratio of the maximum block size to total number of parameters should be small.","element":"span"}],[{"text":"3.1.2 ","element":"span"},{"text":"Toeplitz Analysis","element":"span"}],[{"text":"In this case, the correlation among errors are ","element":"span"},{"style":{"height":16.7},"width":152.64,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-1.png","element":"img","alt":" E[uj,tui,t","inline":true},{"text":"] = ","element":"span"},{"style":{"height":17.36},"width":89.64,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-2.png","element":"img","alt":" ρ|i−j|","inline":true},{"text":", with ","element":"span"},{"style":{"height":16},"width":84.76,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-3.png","element":"img","alt":" |ρ| <","inline":true,"padRight":true},{"text":"1. Then.","element":"span"}],[{"style":{"width":"34%"},"width":653,"height":384,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-4.png","element":"img"}],[{"text":"We have the tri-diagonal inverse, with all other cells being zero except the main and two adjacent diagonals.","element":"span"}],[{"style":{"width":"90%"},"width":1693,"height":690,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-5.png","element":"img"}],[{"text":"Clearly Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i) is satisfied since the sum on the right side converges to a constant.","element":"span"}],[{"text":"3.2 ","element":"span"},{"text":"Algorithm For Asset Return Based Precision Matrix Estimation","element":"span"}],[{"style":{"width":"0%"},"width":10,"height":3,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-6.png","element":"img"}],[{"text":"Here we provide a practical algorithm to get the precision matrix estimator for asset returns, ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-7.png","element":"img","alt":"�Γ","inline":true},{"text":", and it will depend on the residual-based nodewise regression estimator ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-8.png","element":"img","alt":"�Ω","inline":true},{"text":", and its symmetric version ","element":"span"},{"style":{"height":15.5},"width":93.28,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-9.png","element":"img","alt":"�Ωsym","inline":true},{"text":".","element":"span"}],[{"text":"1. Use equation ","element":"span"},{"href":"#id-36","text":"(7) ","element":"a"},{"text":"to set up the residual from a least squares based regression via known factors with","element":"span"}],[{"style":{"width":"94%"},"width":1771,"height":266,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-10.png","element":"img"}],[{"text":"2. Form the transpose matrix of residuals for all asset returns except ","element":"span"},{"text":"j","element":"span"},{"text":"th one, ","element":"span"},{"style":{"height":18.14},"width":74.44,"height":45.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-11.png","element":"img","alt":"�U′−j","inline":true},{"text":", which is a ","element":"span"},{"style":{"height":11.2},"width":132.28,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-12.png","element":"img","alt":" n × p −","inline":true,"padRight":true},{"text":"1","element":"span"}],[{"style":{"width":"57%"},"width":1082,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/13-13.png","element":"img"}],[{"style":{"width":"97%"},"width":1822,"height":294,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-0.png","element":"img"}],[{"text":"4. Use equation ","element":"span"},{"href":"#id-54","text":"(13) ","element":"a"},{"text":"to get ","element":"span"},{"style":{"height":19.98},"width":37.6,"height":49.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-1.png","element":"img","alt":" �τ 2j","inline":true,"padRight":true},{"text":".","element":"span"}],[{"text":"5. Now form ","element":"span"},{"style":{"height":17.95},"width":47.12,"height":44.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-2.png","element":"img","alt":"�Ω′j","inline":true,"padRight":true},{"text":"which is a row in the precision matrix estimate for the errors with 1","element":"span"},{"style":{"height":19.98},"width":57.76,"height":49.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-3.png","element":"img","alt":"/�τ2j","inline":true,"padRight":true},{"text":"as ","element":"span"},{"text":"j","element":"span"},{"text":"th element ","element":"span"},{"text":"of that ","element":"span"},{"text":"j","element":"span"},{"text":"th row, and put all other elements of the ","element":"span"},{"text":"j","element":"span"},{"text":"th row, as ","element":"span"},{"style":{"height":19.79},"width":133.12,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-4.png","element":"img","alt":" −�Γ′j/�τ2j","inline":true,"padRight":true},{"text":".","element":"span"}],[{"style":{"width":"0%"},"width":10,"height":3,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-5.png","element":"img"}],[{"text":"6. Run steps 1-5 for all ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":116.56,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-6.png","element":"img","alt":", · · · , p","inline":true},{"text":". Stack all rows ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-7.png","element":"img","alt":", · · · , p","inline":true,"padRight":true},{"text":"to form ","element":"span"},{"style":{"height":11.2},"width":81.04,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-8.png","element":"img","alt":" p×p","inline":true,"padRight":true},{"text":"matrix: ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-9.png","element":"img","alt":"�Ω","inline":true},{"text":". Form symmetric version by ","element":"span"},{"style":{"height":15.5},"width":93.28,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-10.png","element":"img","alt":"�Ωsym","inline":true,"padRight":true},{"text":":=","element":"span"},{"style":{"height":19.51},"width":86.32,"height":48.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-11.png","element":"img","alt":"�Ω+�Ω′2","inline":true,"padRight":true},{"text":".","element":"span"}],[{"text":"7. Form","element":"span"}],[{"style":{"width":"94%"},"width":1773,"height":453,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-12.png","element":"img"}],[{"text":"8. Now form the precision matrix estimate for all asset returns by ","element":"span"},{"href":"#id-45","text":"(19) ","element":"a"},{"text":"and steps 6-7:","element":"span"}],[{"style":{"width":"94%"},"width":1772,"height":241,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-13.png","element":"img"}]]},{"heading":"4 Sharpe Ratio Analysis with Large Number of Assets","paragraphs":[[{"text":"In this section, we apply the results, mainly the estimation of precision matrix of returns, to the analysis of the Sharpe Ratio with large number of assets. Specifically, we allow ","element":"span"},{"style":{"height":11.6},"width":126.88,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-14.png","element":"img","alt":" p → ∞","inline":true},{"text":", when ","element":"span"},{"style":{"height":8.8},"width":130.72,"height":22,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/14-15.png","element":"img","alt":" n → ∞","inline":true},{"text":". There will be four themes in each subsection below. But all of these themes relate to the analysis of consistency of the Sharpe Ratio in portfolios with a large number of assets. All our theoretical analysis is without transaction costs, however in simulations and also in empirical exercise we consider the presence of transaction costs.","element":"span"}],[{"text":"The first subsection analyzes the Sharpe Ratio of Global Minimum Variance (GMV) portfolio, and Markowitz Mean-Variance (MMV) portfolio. In the GMV portfolio, we choose the weights to minimize the variance of the portfolio and restricted to sum one. Short-sales are allowed. The Sharpe Ratio is then constructed by dividing the mean portfolio returns by its standard deviation. In MMV portfolio, weights are chosen exactly as GMV but we also impose a target for the portfolio mean return.","element":"span"}],[{"text":"The second subsection considers choosing the weights of the portfolio in such a way to maximize the Sharpe Ratio, subject to weights of the portfolio adding up to one. Short sales are allowed. The main difference between GMV in Section ","element":"span"},{"href":"#id-55","text":"4.1.1, ","element":"a"},{"text":"and the Constrained Maximum Sharpe Ratio in Section ","element":"span"},{"href":"#id-56","text":"4.2, ","element":"a"},{"text":"is that weights are chosen to minimize the variance in GMV portfolio and then the Sharpe Ratio is computed and, in case of the Constrained Maximum Sharpe Ratio, weights are chosen to maximize the Sharpe Ratio directly. Both methods use the same constraint that the weights of the portfolio should add up to one. In case of the MMV portfolio in Section ","element":"span"},{"href":"#id-57","text":"4.1.2 ","element":"a"},{"text":"weights are chosen first to minimize the portfolio variance under the conditions described earlier and then, the Sharpe Ratio is computed. The constraint of weights adding up to one is helpful in visualizing assets in percentage terms.","element":"span"}],[{"text":"In the third subsection, we analyze the maximum out-of-sample Sharpe Ratio. Here, we do not have a constraint that all weights of the portfolio should add up to one as in Sections ","element":"span"},{"href":"#id-55","text":"4.1.1, ","element":"a"},{"href":"#id-57","text":"4.1.2, ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-56","text":"4.2. ","element":"a"},{"text":"The analysis is out-sample unlike the GMV, MMV, and Constrained Maximum Sharpe Ratio portfolios. Weights are chosen to maximize the portfolio returns subject to a constraint of a given variance. But the maximum out-of-sample Sharpe Ratio use estimated weights, with population out-sample mean return vector and the out-sample covariance matrix of returns in the formula. Since the maximum eigenvalue of out-sample covariance matrix of returns is growing, this affects the estimation error rate. Specifically, Sections ","element":"span"},{"href":"#id-55","text":"4.1.1, ","element":"a"},{"href":"#id-57","text":"4.1.2, ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-56","text":"4.2 ","element":"a"},{"text":"allow ","element":"span"},{"text":"p > n ","element":"span"},{"text":"and we still get consistency, when ","element":"span"},{"style":{"height":11.6},"width":275.68,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/15-0.png","element":"img","alt":" n → ∞, p → ∞","inline":true},{"text":". With the maximum-out-of-sample Sharpe Ratio we get consistency only when ","element":"span"},{"text":"p < n ","element":"span"},{"text":"and ","element":"span"},{"style":{"height":11.6},"width":265.6,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/15-1.png","element":"img","alt":" n → ∞, p → ∞","inline":true},{"text":".","element":"span"}],[{"text":"In the fourth subsection, we consider the effect of estimated portfolio weights on obtaining the optimal Sharpe Ratio in large samples. Specifically, we estimate the weights and substitute this into the Sharpe Ratio formula, with keeping ","element":"span"},{"style":{"height":15.5},"width":94.72,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/15-2.png","element":"img","alt":" µ, Σy","inline":true,"padRight":true},{"text":"intact, and then try to show that this estimate is consistent. We show that it is possible only in the case of ","element":"span"},{"text":"p < n","element":"span"},{"text":", and this includes diverging number of assets and time span.","element":"span"}],[{"text":"Before we state the theorems, we need the following sparsity assumption. Assumption ","element":"span"},{"href":"#id-58","text":"8(","element":"a"},{"text":"i) below replaces Assumption ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"ii). In Assumption ","element":"span"},{"href":"#id-58","text":"8(","element":"a"},{"text":"ii), the first term shows square of the maximum Sharpe Ratio is lower bounded, (scaled by ","element":"span"},{"text":"p","element":"span"},{"text":"), to be positive. Scaling by ","element":"span"},{"text":"p ","element":"span"},{"text":"is needed since the numerator is summed over ","element":"span"},{"text":"p ","element":"span"},{"text":"terms. In a similar way, the second term in Assumption ","element":"span"},{"href":"#id-58","text":"8(","element":"a"},{"text":"ii) imposes that the variance of the GMV portfolio (scaled)","element":"span"}],[{"id":"id-58","style":{"width":"88%"},"width":1666,"height":445,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/15-3.png","element":"img"}],[{"text":"4.1 ","element":"span"},{"text":"Commonly Used Portfolios with a Large Number of Assets","element":"span"}],[{"id":"id-55","text":"Here, we provide consistent estimates of the Sharpe Ratio of the GMV and MMV portfolios when ","element":"span"},{"text":"p > n","element":"span"},{"text":".","element":"span"}],[{"text":"4.1.1 ","element":"span"},{"text":"Global Minimum-Variance (GMV) Portfolio","element":"span"}],[{"text":"In this part, we analyze the Sharpe Ratio that we can infer from the GMV portfolio. This is the portfolio in which weights are chosen to minimize the variance of the portfolio subject to the weights summing to one.","element":"span"}],[{"text":"Specifically,","element":"span"}],[{"style":{"width":"72%"},"width":1354,"height":69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-0.png","element":"img"}],[{"text":"The solution to the above problem is well known and is given by","element":"span"}],[{"style":{"width":"16%"},"width":308,"height":115,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-1.png","element":"img"}],[{"text":"Next, substitute these weights into the Sharpe Ratio formula, normalized by the number of assets","element":"span"}],[{"id":"id-59","style":{"width":"77%"},"width":1446,"height":132,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-2.png","element":"img"}],[{"text":"We estimate ","element":"span"},{"href":"#id-59","text":"(26) ","element":"a"},{"text":"by nodewise regression, noting that ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-3.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-4.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":",","element":"span"}],[{"style":{"width":"67%"},"width":1272,"height":156,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-5.png","element":"img"}],[{"text":"The following theorem is also valid when ","element":"span"},{"text":"p > n ","element":"span"},{"text":"and establishes both consistency and rate of convergence in the case of the Sharpe Ratio in the global minimum-variance portfolio.","element":"span"}],[{"id":"id-167","style":{"width":"73%"},"width":1368,"height":320,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-6.png","element":"img"}],[{"text":"1. We see that a large ","element":"span"},{"text":"p ","element":"span"},{"text":"only affects the error by a logarithmic factor as in the definition of ","element":"span"},{"style":{"height":13.1},"width":32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-7.png","element":"img","alt":" ln","inline":true,"padRight":true},{"text":"in ","element":"span"},{"href":"#id-12","text":"(22)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"71%"},"width":1331,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-8.png","element":"img"}],[{"text":"2. In the case of non-sparse precision matrix, we can only get consistency when ","element":"span"},{"text":"p << n","element":"span"},{"text":". To show this, in","element":"span"}],[{"style":{"width":"97%"},"width":1822,"height":429,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/16-9.png","element":"img"}],[{"id":"id-57","text":"4.1.2 ","element":"span"},{"text":"Markowitz Mean-Variance (MMV) Portfolio","element":"span"}],[{"href":"#id-60","text":"Markowitz ","element":"a"},{"href":"#id-60","text":"(1952) ","element":"a"},{"text":"portfolio selection is defined as finding the smallest variance given a desired expected","element":"span"}],[{"text":"return ","element":"span"},{"style":{"height":10},"width":36.64,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-0.png","element":"img","alt":" ρ1","inline":true},{"text":". The decision problem is","element":"span"}],[{"style":{"width":"62%"},"width":1164,"height":69,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-1.png","element":"img"}],[{"text":"The formula for optimal weight is","element":"span"}],[{"id":"id-67","style":{"width":"76%"},"width":1430,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-2.png","element":"img"}],[{"text":"where we use ","element":"span"},{"text":"A, F, D ","element":"span"},{"text":"formulas ","element":"span"},{"text":"A ","element":"span"},{"text":":= ","element":"span"},{"style":{"height":18.43},"width":199.48,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-3.png","element":"img","alt":" 1′pΓ1p/p, F","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.43},"width":188.52,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-4.png","element":"img","alt":" 1′pΓµ/p, D","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":16},"width":137.2,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-5.png","element":"img","alt":" µ′Γµ/p","inline":true},{"text":", with ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-6.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-7.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":". We define ","element":"span"},{"text":"the estimators of these terms as ","element":"span"},{"style":{"height":11.6},"width":30,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-8.png","element":"img","alt":"�A","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.43},"width":199,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-9.png","element":"img","alt":" 1′p�Γ1p/p, �F","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.43},"width":188.52,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-10.png","element":"img","alt":" 1′p�Γ�µ/p, �D","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":16},"width":137.2,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-11.png","element":"img","alt":" �µ′�Γ�µ/p","inline":true},{"text":". The optimal variance of the","element":"span"}],[{"text":"portfolio in this scenario is normalized by the number of assets","element":"span"}],[{"style":{"width":"62%"},"width":1173,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-12.png","element":"img"}],[{"text":"The estimate of that variance is","element":"span"}],[{"style":{"width":"25%"},"width":481,"height":105,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-13.png","element":"img"}],[{"text":"By our constraint, we obtain","element":"span"}],[{"style":{"width":"55%"},"width":1037,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-14.png","element":"img"}],[{"text":"Using the variance ","element":"span"},{"text":"V ","element":"span"},{"text":"above","element":"span"}],[{"id":"id-68","style":{"width":"66%"},"width":1250,"height":140,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-15.png","element":"img"}],[{"text":"The estimate of the Sharpe Ratio under the MMV portfolio is","element":"span"}],[{"style":{"width":"33%"},"width":633,"height":144,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-16.png","element":"img"}],[{"text":"We provide the maximum Sharpe Ratio (squared) consistency in this framework when the number of assets is larger than the sample size. This is a novel result in the literature.","element":"span"}],[{"id":"id-179","style":{"width":"99%"},"width":1868,"height":388,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-17.png","element":"img"}],[{"text":"1. Condition ","element":"span"},{"style":{"height":15.76},"width":293.56,"height":39.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-18.png","element":"img","alt":" AD−F 2 ≥ C1 >","inline":true,"padRight":true},{"text":"0 shows that the variance is bounded away from infinity, and ","element":"span"},{"style":{"height":17.39},"width":231.64,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-19.png","element":"img","alt":" Aρ21−2Fρ1−","inline":true}],[{"style":{"width":"72%"},"width":1361,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/17-20.png","element":"img"}],[{"text":"2. We provide the rate of convergence of the estimators, which increases with ","element":"span"},{"text":"p ","element":"span"},{"text":"in a logarithmic way as","element":"span"}],[{"style":{"width":"93%"},"width":1745,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-0.png","element":"img"}],[{"text":"3. To get consistency when there is non-sparse precision matrix, the same analysis in Remark 2 of Theorem","element":"span"}],[{"style":{"width":"34%"},"width":640,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-1.png","element":"img"}],[{"text":"4. Number of factors slows the rate of convergence of estimation error to zero here. This is due to the fact","element":"span"}],[{"style":{"width":"94%"},"width":1770,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-2.png","element":"img"}],[{"id":"id-56","text":"4.2 ","element":"span"},{"text":"Maximum Sharpe Ratio: Portfolio Weights Normalized to One","element":"span"}],[{"text":"In this section, we define the maximum Sharpe Ratio when the portfolio weights are normalized to one. This, in turn will depend on a critical term that will determine the formula below. The maximum Sharpe Ratio is defined as follows, with ","element":"span"},{"text":"w ","element":"span"},{"text":"as the ","element":"span"},{"style":{"height":11.2},"width":59.8,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-3.png","element":"img","alt":" p ×","inline":true,"padRight":true},{"text":"1 vector of portfolio weights:","element":"span"}],[{"id":"id-61","style":{"width":"36%"},"width":691,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-4.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.1},"width":40.04,"height":37.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-5.png","element":"img","alt":" 1p","inline":true,"padRight":true},{"text":"is a vector of ones. This maximum Sharpe Ratio is constrained to have portfolio weights that sum to one. ","element":"span"},{"href":"#id-2","text":"Maller et al. ","element":"a"},{"href":"#id-2","text":"(2016) ","element":"a"},{"text":"shows that depending on a scalar, it has two solutions. When ","element":"span"},{"style":{"height":20.94},"width":187.48,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-6.png","element":"img","alt":" 1′pΣ−1y µ >","inline":true,"padRight":true},{"text":"0, with ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-7.png","element":"img","alt":"Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-8.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":", we have the square of the maximum Sharpe Ratio:","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":320,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-9.png","element":"img"}],[{"text":"On the other hand, when ","element":"span"},{"style":{"height":20.94},"width":187.48,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-10.png","element":"img","alt":" 1′pΣ−1y µ ≤","inline":true,"padRight":true},{"text":"0, we have","element":"span"}],[{"style":{"width":"70%"},"width":1320,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-11.png","element":"img"}],[{"text":"This is equation (6.1) of ","element":"span"},{"href":"#id-2","text":"Maller et al. ","element":"a"},{"href":"#id-2","text":"(2016)","element":"a"},{"text":". Equation ","element":"span"},{"href":"#id-61","text":"(32) ","element":"a"},{"text":"is used in the literature, and this is the formula when the weights do not necessarily sum to one given a return constraint as in ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019)","element":"a"},{"text":". In case of ","element":"span"},{"style":{"height":20.94},"width":193.72,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-12.png","element":"img","alt":" 1′pΣ−1y µ ≤","inline":true,"padRight":true},{"text":"0, in equations (2.7)-(2.10) of ","element":"span"},{"href":"#id-1","text":"Maller and Turkington ","element":"a"},{"href":"#id-1","text":"(2002)","element":"a"},{"text":", there is an approximation to ","element":"span"},{"text":"optimal portfolio weights. To be specific, with a positive ","element":"span"},{"style":{"height":12.4},"width":63.16,"height":31,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-13.png","element":"img","alt":" δ >","inline":true,"padRight":true},{"text":"0, optimal portfolio weights, which is (","element":"span"},{"style":{"height":11.2},"width":60.76,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-14.png","element":"img","alt":"p ×","inline":true,"padRight":true},{"text":"1) vector:","element":"span"}],[{"style":{"width":"30%"},"width":578,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-15.png","element":"img"}],[{"text":"where","element":"span"}],[{"style":{"width":"44%"},"width":825,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/18-16.png","element":"img"}],[{"text":"is a (","element":"span"},{"style":{"height":10},"width":59.32,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-0.png","element":"img","alt":"p −","inline":true,"padRight":true},{"text":"1) ","element":"span"},{"style":{"height":8},"width":31,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-1.png","element":"img","alt":" ×","inline":true,"padRight":true},{"text":"1 matrix with ","element":"span"},{"style":{"height":16.3},"width":91.84,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-2.png","element":"img","alt":" Ap−1","inline":true,"padRight":true},{"text":":= (","element":"span"},{"style":{"height":16.7},"width":234.32,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-3.png","element":"img","alt":"Ip−1, −1p−1)′","inline":true,"padRight":true},{"text":": ","element":"span"},{"style":{"height":11.2},"width":126.52,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-4.png","element":"img","alt":" p × p −","inline":true,"padRight":true},{"text":"1 matrix, with 1","element":"span"},{"style":{"height":10},"width":57.28,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-5.png","element":"img","alt":"p−1","inline":true,"padRight":true},{"text":"a (","element":"span"},{"style":{"height":10},"width":59.32,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-6.png","element":"img","alt":"p −","inline":true,"padRight":true},{"text":"1) column vector of","element":"span"}],[{"text":"ones, and","element":"span"}],[{"style":{"width":"36%"},"width":675,"height":114,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-7.png","element":"img"}],[{"text":"is of dimension ","element":"span"},{"style":{"height":11.2},"width":59.8,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-8.png","element":"img","alt":" p ×","inline":true,"padRight":true},{"text":"1.","element":"span"}],[{"text":"When ","element":"span"},{"style":{"height":12},"width":132.16,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-9.png","element":"img","alt":" δ → ∞","inline":true},{"text":", the weights can provide the maximum Sharpe Ratio: ","element":"span"},{"style":{"height":13.11},"width":114.32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-10.png","element":"img","alt":" MSRc","inline":true},{"text":", as discussed in p.504 of ","element":"span"},{"href":"#id-1","text":"Maller and Turkington ","element":"a"},{"href":"#id-1","text":"(2002)","element":"a"},{"text":".","element":"span"}],[{"text":"These equations can be estimated by their sample counterparts, but in the case of ","element":"span"},{"text":"p > n","element":"span"},{"text":", ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-11.png","element":"img","alt":"�Σn","inline":true,"padRight":true},{"text":"is not invertible, so we need to use new tools from high-dimensional statistics. We use the nodewise regression precision matrix estimate of ","element":"span"},{"href":"#id-0","text":"Meinshausen and B¨uhlmann ","element":"a"},{"href":"#id-0","text":"(2006)","element":"a"},{"text":". ","element":"span"},{"text":"This estimate is denoted by ","element":"span"},{"style":{"height":10.8},"width":110.76,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-12.png","element":"img","alt":"�Ω. �Ω","inline":true,"padRight":true},{"text":"is incorporated into the precision matrix of returns ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-13.png","element":"img","alt":"Γ","inline":true},{"text":".","element":"span"}],[{"text":"We will also introduce the maximum Sharpe Ratio, which addresses the uncertainty regarding whether","element":"span"}],[{"text":"we should analyze ","element":"span"},{"text":"MSR ","element":"span"},{"text":"or ","element":"span"},{"style":{"height":13.11},"width":114.32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-14.png","element":"img","alt":" MSRc","inline":true},{"text":". This is","element":"span"}],[{"style":{"width":"50%"},"width":942,"height":60,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-15.png","element":"img"}],[{"text":"Note also that with ","element":"span"},{"style":{"height":20.94},"width":145.12,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-16.png","element":"img","alt":" 1′pΣ−1y µ","inline":true,"padRight":true},{"text":"= 0, ","element":"span"},{"style":{"height":13.1},"width":272.72,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-17.png","element":"img","alt":" MSR = MSRc","inline":true},{"text":". The estimators for ","element":"span"},{"style":{"height":14.16},"width":368.8,"height":35.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-18.png","element":"img","alt":" MSR, MSRc, MSR∗","inline":true,"padRight":true},{"text":"will be intro- ","element":"span"},{"text":"duced in the next subsection.","element":"span"}],[{"text":"4.2.1 ","element":"span"},{"text":"Consistency and Rate of Convergence of Constrained Maximum Sharpe Ratio Estimators","element":"span"}],[{"text":"First, when ","element":"span"},{"style":{"height":20.94},"width":187.48,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-19.png","element":"img","alt":" 1′pΣ−1y µ >","inline":true,"padRight":true},{"text":"0, we have the square of the maximum Sharpe Ratio as in ","element":"span"},{"href":"#id-61","text":"(32)","element":"a"},{"text":". Namely, the estimate ","element":"span"},{"text":"of the square of the maximum Sharpe Ratio is:","element":"span"}],[{"id":"id-63","style":{"width":"99%"},"width":1868,"height":389,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-20.png","element":"img"}],[{"text":"1. We allow ","element":"span"},{"text":"p > n ","element":"span"},{"text":"and ","element":"span"},{"text":"p ","element":"span"},{"text":"can grow exponentially in ","element":"span"},{"text":"n","element":"span"},{"text":". We also allow for time-series data and establish","element":"span"}],[{"style":{"width":"94%"},"width":1771,"height":173,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-21.png","element":"img"}],[{"text":"2. When there is no sparsity of the precision matrix, i.e. ","element":"span"},{"text":"¯","element":"span"},{"text":"s ","element":"span"},{"text":"= ","element":"span"},{"text":"p","element":"span"},{"text":", we can still get consistency but for","element":"span"}],[{"style":{"width":"84%"},"width":1575,"height":231,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/19-22.png","element":"img"}],[{"text":"If ","element":"span"},{"style":{"height":20.94},"width":187.48,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-0.png","element":"img","alt":" 1′pΣ−1y µ ≤","inline":true,"padRight":true},{"text":"0, the Sharpe Ratio is minimized, as shown on p.503 of ","element":"span"},{"href":"#id-1","text":"Maller and Turkington ","element":"a"},{"href":"#id-1","text":"(2002)","element":"a"},{"text":". The ","element":"span"},{"text":"new maximum Sharpe Ratio in the case when ","element":"span"},{"style":{"height":20.95},"width":193.72,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-1.png","element":"img","alt":" 1′pΣ−1y µ ≤","inline":true,"padRight":true},{"text":"0 is in Theorem 2.1 of ","element":"span"},{"href":"#id-1","text":"Maller and Turkington ","element":"a"},{"href":"#id-1","text":"(2002)","element":"a"},{"text":". The square of the maximum Sharpe Ratio when ","element":"span"},{"style":{"height":20.94},"width":187.48,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-2.png","element":"img","alt":" 1′pΣ−1y µ ≤","inline":true,"padRight":true},{"text":"0 is given in (33).","element":"span"}],[{"text":"An estimator in this case is","element":"span"}],[{"style":{"width":"66%"},"width":1248,"height":66,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-3.png","element":"img"}],[{"text":"The optimal portfolio allocation for such a case is given in (2.10) of ","element":"span"},{"href":"#id-1","text":"Maller and Turkington ","element":"a"},{"href":"#id-1","text":"(2002)","element":"a"},{"text":", and shown in ","element":"span"},{"style":{"height":12.3},"width":74.08,"height":30.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-4.png","element":"img","alt":" wc,2","inline":true,"padRight":true},{"text":"here in Section 4.2. The limit for such estimators when the number of assets is fixed (","element":"span"},{"text":"p ","element":"span"},{"text":"fixed) is given in Theorems 3.1b-c of ","element":"span"},{"href":"#id-2","text":"Maller et al. ","element":"a"},{"href":"#id-2","text":"(2016)","element":"a"},{"text":".","element":"span"}],[{"id":"id-62","style":{"width":"100%"},"width":1874,"height":320,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-5.png","element":"img"}],[{"text":"1. In Theorem ","element":"span"},{"href":"#id-62","text":"6, ","element":"a"},{"text":"we allow ","element":"span"},{"text":"p > n","element":"span"},{"text":", and time-series data are allowed, unlike the iid or normal return cases","element":"span"}],[{"style":{"width":"42%"},"width":789,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-6.png","element":"img"}],[{"text":"2. Case of non-sparse precision matrix proceeds in the same way as Remark 2 of Theorem ","element":"span"},{"href":"#id-63","text":"5. ","element":"a"},{"text":"To have","element":"span"}],[{"style":{"width":"51%"},"width":962,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-7.png","element":"img"}],[{"text":"We provide an estimate that takes into account uncertainties about the term ","element":"span"},{"style":{"height":20.94},"width":145.12,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-8.png","element":"img","alt":" 1′pΣ−1y µ","inline":true},{"text":". Note that the ","element":"span"},{"text":"term can be consistently estimated, as shown in Lemma ","element":"span"},{"href":"#id-64","text":"B.3 ","element":"a"},{"text":"in Supplement B. A practical estimate for a maximum Sharpe Ratio that will be consistent is:","element":"span"}],[{"style":{"width":"42%"},"width":793,"height":113,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-9.png","element":"img"}],[{"text":"where we excluded the case of ","element":"span"},{"style":{"height":17.23},"width":97.12,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-10.png","element":"img","alt":" 1′p�Γ�µ","inline":true,"padRight":true},{"text":"= 0 in the estimator. That specific scenario is very restrictive in terms ","element":"span"},{"text":"of returns and variance. Note that under a mild assumption, when ","element":"span"},{"style":{"height":17.04},"width":145.24,"height":42.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-11.png","element":"img","alt":" 1′pΓµ >","inline":true,"padRight":true},{"text":"0, we have ","element":"span"},{"style":{"height":17.04},"width":145.24,"height":42.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-12.png","element":"img","alt":" 1′p�Γ�µ >","inline":true,"padRight":true},{"text":"0, and ","element":"span"},{"text":"when ","element":"span"},{"style":{"height":17.23},"width":139.48,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-13.png","element":"img","alt":" 1′pΓµ <","inline":true,"padRight":true},{"text":"0, we have ","element":"span"},{"style":{"height":17.23},"width":139,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-14.png","element":"img","alt":" 1′p�Γ�µ <","inline":true,"padRight":true},{"text":"0 with probability approaching one in the proof of Theorem ","element":"span"},{"href":"#id-65","text":"7. ","element":"a"},{"text":"Note that ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-15.png","element":"img","alt":"Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-16.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":".","element":"span"}],[{"id":"id-65","text":"Theorem 7. ","element":"span"},{"text":"Under Assumptions ","element":"span"},{"href":"#id-38","text":"1-","element":"a"},{"href":"#id-41","text":"4,","element":"a"},{"href":"#id-48","text":"6,","element":"a"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i), ","element":"span"},{"href":"#id-58","text":"8, ","element":"a"},{"text":"with ","element":"span"},{"style":{"height":15.76},"width":312.28,"height":39.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-17.png","element":"img","alt":" AD − F 2 ≥ C1 >","inline":true,"padRight":true},{"text":"0","element":"span"},{"text":", where ","element":"span"},{"style":{"height":13.11},"width":44.32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-18.png","element":"img","alt":" C1","inline":true,"padRight":true},{"text":"is a positive constant, and assuming ","element":"span"},{"style":{"height":18.43},"width":411.16,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-19.png","element":"img","alt":" |1′pΓµ|/p ≥ C > 2ǫ >","inline":true,"padRight":true},{"text":"0","element":"span"},{"text":", with a sufficiently small positive ","element":"span"},{"style":{"height":9.6},"width":65.56,"height":24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-20.png","element":"img","alt":" ǫ >","inline":true,"padRight":true},{"text":"0","element":"span"},{"text":", and ","element":"span"},{"text":"C ","element":"span"},{"text":"being a positive","element":"span"}],[{"style":{"width":"67%"},"width":1261,"height":240,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-21.png","element":"img"}],[{"text":"1. In the case of ","element":"span"},{"text":"p > n","element":"span"},{"text":", we only consider consistency since standard central limit theorems (apart from","element":"span"}],[{"style":{"width":"94%"},"width":1771,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/20-22.png","element":"img"}],[{"style":{"width":"94%"},"width":1770,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-0.png","element":"img"}],[{"text":"2. The case of non-sparse precision matrix with ¯","element":"span"},{"text":"s ","element":"span"},{"text":"= ","element":"span"},{"text":"p ","element":"span"},{"text":"proceeds in the same way as in Remark 2 after","element":"span"}],[{"style":{"width":"97%"},"width":1821,"height":925,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-1.png","element":"img"}],[{"text":"4.3 ","element":"span"},{"text":"Maximum Out-of-Sample Sharpe Ratio","element":"span"}],[{"text":"This section analyzes the maximum out of Sharpe Ratio that is considered in ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019)","element":"a"},{"text":". To obtain that formula, we need the optimal calculation of the weights of the portfolio. The optimization of the portfolio","element":"span"}],[{"text":"weights is formulated as","element":"span"}],[{"style":{"width":"72%"},"width":1361,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-2.png","element":"img"}],[{"text":"where we maximize the return subject to a specified positive and finite risk constraint, ","element":"span"},{"style":{"height":14.16},"width":88.12,"height":35.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-3.png","element":"img","alt":" σ2 >","inline":true,"padRight":true},{"text":"0. Equation (A.2) of ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019) ","element":"a"},{"text":"defines the estimated maximum out-of-sample ratio when ","element":"span"},{"text":"p < n","element":"span"},{"text":", with the inverse of the sample covariance matrix, ","element":"span"},{"style":{"height":25.55},"width":74.08,"height":63.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-4.png","element":"img","alt":"�Σ−1y","inline":true,"padRight":true},{"text":"= [ ","element":"span"},{"style":{"height":19.5},"width":264.16,"height":48.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-5.png","element":"img","alt":"1n�nt=1 yty′t]−1","inline":true,"padRight":true},{"text":"used as an estimator for the precision matrix ","element":"span"},{"text":"estimate:","element":"span"}],[{"style":{"width":"29%"},"width":557,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-6.png","element":"img"}],[{"text":"The theoretical version is written as, by definition of ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-7.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-8.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":",","element":"span"}],[{"style":{"width":"15%"},"width":281,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-9.png","element":"img"}],[{"text":"Then, equation (1.1) of ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019) ","element":"a"},{"text":"shows that when ","element":"span"},{"style":{"height":16},"width":199.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/21-10.png","element":"img","alt":" p/n → r1 ∈","inline":true,"padRight":true},{"text":"(0","element":"span"},{"text":", ","element":"span"},{"text":"1), the above plug-in maximum out-of-sample ratio cannot consistently estimate the theoretical version. The optimal weights of a portfolio are given in (2.3) of ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019) ","element":"a"},{"text":"in an out-of-sample context given a risk level. This comes from maximizing the expected portfolio return subject to its variance being constrained by the square of the risk, where this","element":"span"}],[{"style":{"width":"57%"},"width":1081,"height":177,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-0.png","element":"img"}],[{"text":"The estimates that we will use","element":"span"}],[{"style":{"width":"38%"},"width":715,"height":147,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-1.png","element":"img"}],[{"text":"Our maximum out-of-sample Sharpe Ratio estimate using the nodewise estimate ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-2.png","element":"img","alt":"�Γ","inline":true,"padRight":true},{"text":"is:","element":"span"}],[{"style":{"width":"40%"},"width":765,"height":137,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-3.png","element":"img"}],[{"text":"Below we provide a sparsity assumption for the case of maximum out of sample Sharpe Ratio.","element":"span"}],[{"id":"id-66","style":{"width":"67%"},"width":1272,"height":447,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-4.png","element":"img"}],[{"text":"1. Note that p.4353 of ","element":"span"},{"href":"#id-15","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-15","text":"(2017) ","element":"a"},{"text":"shows that the maximum out-of-sample Sharpe Ratio is","element":"span"}],[{"style":{"width":"94%"},"width":1771,"height":244,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-5.png","element":"img"}],[{"text":"2. We cannot have ","element":"span"},{"text":"p > n ","element":"span"},{"text":"in this theorem, due to Assumption ","element":"span"},{"href":"#id-66","text":"9, ","element":"a"},{"text":"this shows the difficulty of maximum out","element":"span"}],[{"style":{"width":"94%"},"width":1770,"height":111,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-6.png","element":"img"}],[{"text":"3. ","element":"span"},{"style":{"height":14},"width":144.28,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-7.png","element":"img","alt":" p¯sln = o","inline":true},{"text":"(1) can be also obtained in non-sparse precision matrix, although the conditions will be more","element":"span"}],[{"style":{"width":"71%"},"width":1331,"height":244,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-8.png","element":"img"}],[{"text":"4. The case of large non-negative weights can be handled with our analysis. This is the case of growing","element":"span"}],[{"style":{"width":"94%"},"width":1771,"height":181,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/22-9.png","element":"img"}],[{"text":"4.4 ","element":"span"},{"text":"Portfolio Estimation Based Sharpe Ratio Analysis","element":"span"}],[{"text":"In this section for the scenarios we considered in Sections 4.1-4.2, we form the estimate of the portfolio weights and substitute that into the Sharpe Ratio. To understand the effects of only portfolio estimation for consistent estimation of Sharpe Ratio, we keep ","element":"span"},{"style":{"height":15.5},"width":94.72,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-0.png","element":"img","alt":" µ, Σy","inline":true,"padRight":true},{"text":"as constants in Sharpe Ratio estimates. We start with","element":"span"}],[{"text":"GMV portfolio. The estimated portfolio weights are","element":"span"}],[{"style":{"width":"14%"},"width":271,"height":114,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-1.png","element":"img"}],[{"text":"The Sharpe Ratio estimate of this portfolio is:","element":"span"}],[{"style":{"width":"42%"},"width":804,"height":157,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-2.png","element":"img"}],[{"text":"The optimized-target population Sharpe Ratio is given in ","element":"span"},{"href":"#id-59","text":"(26)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"69%"},"width":1305,"height":227,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-3.png","element":"img"}],[{"text":"Now we consider the Sharpe Ratio based on Markowitz portfolio. The estimated portfolio weights are","element":"span"}],[{"style":{"width":"45%"},"width":852,"height":99,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-4.png","element":"img"}],[{"text":"These are estimates by plugging in terms in equation ","element":"span"},{"href":"#id-67","text":"(28)","element":"a"},{"text":". Denote the Sharpe Ratio based on portfolio","element":"span"}],[{"text":"weight estimates","element":"span"}],[{"style":{"width":"24%"},"width":455,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-5.png","element":"img"}],[{"text":"The optimal Sharpe Ratio is in ","element":"span"},{"href":"#id-68","text":"(31) ","element":"a"},{"text":"in this case.","element":"span"}],[{"style":{"width":"99%"},"width":1872,"height":300,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-6.png","element":"img"}],[{"text":"In case of constrained maximum Sharpe Ratio in section 4.2, when ","element":"span"},{"style":{"height":20.94},"width":190.84,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-7.png","element":"img","alt":" 1′pΣ−1y µ >","inline":true,"padRight":true},{"text":"0, we can establish the","element":"span"}],[{"text":"portfolio weight estimates","element":"span"}],[{"style":{"width":"13%"},"width":244,"height":108,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-8.png","element":"img"}],[{"text":"Constrained maximum Sharpe Ratio estimate when ","element":"span"},{"style":{"height":20.95},"width":187.48,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-9.png","element":"img","alt":" 1′pΣ−1y µ >","inline":true,"padRight":true},{"text":"0 is:","element":"span"}],[{"style":{"width":"39%"},"width":732,"height":135,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/23-10.png","element":"img"}],[{"text":"The optimal Sharpe Ratio in this case is in ","element":"span"},{"href":"#id-61","text":"(32)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"68%"},"width":1291,"height":240,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-0.png","element":"img"}],[{"text":"The constrained maximum Sharpe Ratio weights when ","element":"span"},{"style":{"height":20.94},"width":187.48,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-1.png","element":"img","alt":" 1′pΣ−1y µ ≤","inline":true,"padRight":true},{"text":"0 are more complicated as seen in ","element":"span"},{"style":{"height":9.9},"width":48.08,"height":24.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-2.png","element":"img","alt":" wc","inline":true,"padRight":true},{"text":"in Section 4.2. The estimate is:","element":"span"}],[{"style":{"width":"29%"},"width":559,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-3.png","element":"img"}],[{"text":"with","element":"span"}],[{"style":{"width":"44%"},"width":838,"height":272,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-4.png","element":"img"}],[{"text":"Note that maximum Sharpe Ratio in this second constrained case is:","element":"span"}],[{"style":{"width":"24%"},"width":456,"height":131,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-5.png","element":"img"}],[{"text":"Using ˆ","element":"span"},{"style":{"height":12.31},"width":74.08,"height":30.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-6.png","element":"img","alt":"wc,2","inline":true,"padRight":true},{"text":"poses several challenges. Taking ","element":"span"},{"style":{"height":12},"width":129.76,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-7.png","element":"img","alt":" δ → ∞","inline":true,"padRight":true},{"text":"to reach the optimal Sharpe Ratio is key but the rate may play a role and also the weights depend on ˆ","element":"span"},{"style":{"height":9.51},"width":86.6,"height":23.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-8.png","element":"img","alt":"umax","inline":true,"padRight":true},{"text":"term which depends on ˆ","element":"span"},{"style":{"height":9.51},"width":87.6,"height":23.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-9.png","element":"img","alt":"zmax","inline":true,"padRight":true},{"text":"that depends on precision matrix estimate ","element":"span"},{"text":"ˆ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-10.png","element":"img","alt":"Γ","inline":true},{"text":", mean estimate ˆ","element":"span"},{"style":{"height":10},"width":24,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-11.png","element":"img","alt":"µ","inline":true},{"text":", and estimate ","element":"span"},{"style":{"height":13.11},"width":114.32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/24-12.png","element":"img","alt":"�MSRc","inline":true,"padRight":true},{"text":"from section 4.2. So, given Theorems 2 and 6, we think that consistency is plausible. However, given the lengthy material in this paper, this is beyond the scope of our theoretical analysis. Hence, similar corollaries for Theorems 6-7 cannot be handled in this paper.","element":"span"}],[{"text":"An important fact that applies to all Corollaries here is that we can only have ","element":"span"},{"text":"p < n ","element":"span"},{"text":"case, as discussed ","element":"span"},{"id":"id-30","text":"in Remark 3 of Theorem 8.","element":"span"}]]},{"heading":"5 Simulations","paragraphs":[[{"text":"5.1 ","element":"span"},{"text":"Models and Implementation Details","element":"span"}],[{"text":"In this section, we compare the nodewise regression with several models in a simulation exercise. The two aims of the exercise are to determine whether our method achieves consistency and how our method performs compared to others in the estimation of the constrained maximum Sharpe Ratio, the out-of-sample maximum Sharpe Ratio, and the Sharpe Ratio in global minimum-variance and Markowitz mean-variance portfolios.","element":"span"}],[{"text":"The other methods that are used widely in the literature and benefit from high-dimensional techniques are the principal orthogonal complement thresholding (POET) from ","element":"span"},{"href":"#id-69","text":"Fan et al. ","element":"a"},{"href":"#id-69","text":"(2013)","element":"a"},{"text":", the nonlinear shrinkage (NL-LW) and the single factor nonlinear shrinkage (SF-NL-LW) from ","element":"span"},{"href":"#id-15","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-15","text":"(2017)","element":"a"},{"text":", and the maximum Sharpe Ratio estimated and sparse regression (MAXSER) from ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019)","element":"a"},{"text":". All models except for the MAXSER are plug-in estimators, where the first step is to estimate the precision/covariance matrix, and the second step is to plug-in the estimate in the desired equation.","element":"span"}],[{"text":"The POET uses principal components to estimate the covariance matrix and allows some eigenvalues of ","element":"span"},{"style":{"height":13.11},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-0.png","element":"img","alt":"Σn","inline":true,"padRight":true},{"text":"to be spiked and grow at a rate ","element":"span"},{"text":"O","element":"span"},{"text":"(","element":"span"},{"text":"p","element":"span"},{"text":"), which allows common and idiosyncratic components to be identified via principal components analysis and can consistently estimate the space spanned by the eigenvectors of ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-1.png","element":"img","alt":"Σn","inline":true},{"text":". However, ","element":"span"},{"href":"#id-69","text":"Fan et al. ","element":"a"},{"href":"#id-69","text":"(2013) ","element":"a"},{"text":"point out that the absolute convergence rate of the model is not satisfactory for estimating ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-2.png","element":"img","alt":" Σn","inline":true},{"text":", and consistency can only be achieved in terms of the relative error matrix.","element":"span"}],[{"text":"Nonlinear shrinkage is a method that individually determines the amount of shrinkage of each eigenvalue in the covariance matrix for a particular loss function. The main aim is to increase the value of the lowest eigenvalues and decrease the largest eigenvalues to stabilize the high-dimensional covariance matrix. This nonlinear method is a very novel and excellent idea. ","element":"span"},{"href":"#id-15","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-15","text":"(2017) ","element":"a"},{"text":"propose a function that captures the objective of an investor using portfolio selection. As a result, they have an optimal estimator of the covariance matrix for portfolio selection for many assets. The SF-NL-LW method extracts a single factor structure from the data before estimating the covariance matrix, which is simply an equal-weighted portfolio with all assets.","element":"span"}],[{"text":"Finally, the MAXSER starts with estimating the adjusted squared maximum Sharpe Ratio used in a penalized regression to obtain the portfolio weights. Of all the discussed models, the MAXSER is the only one that does not estimate the precision matrix in a plug-in estim","element":"span"},{"href":"#id-15","text":"ator of the maxim","element":"a"},{"href":"#id-15","text":"um Sh","element":"a"},{"text":"arpe Ratio.","element":"span"}],[{"text":"Regarding implementation, the POET and both models from ","element":"span"},{"href":"#id-15","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-15","text":"(2017) ","element":"a"},{"text":"are available in the R packages POET ","element":"span"},{"href":"#id-70","text":"Fan et al. ","element":"a"},{"href":"#id-70","text":"(2016) ","element":"a"},{"text":"and nlshrink ","element":"span"},{"href":"#id-71","text":"Ramprasad ","element":"a"},{"href":"#id-71","text":"(2016)","element":"a"},{"text":". The SF-NL-LW needs some minor adjustments following the procedures described in ","element":"span"},{"href":"#id-15","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-15","text":"(2017)","element":"a"},{"text":". For the MAXSER, we follow the steps for the non-factor case in ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019)","element":"a"},{"text":", and we use the package lars ","element":"span"},{"href":"#id-72","text":"(Hastie and Efron ","element":"a"},{"href":"#id-72","text":"(2013)","element":"a"},{"text":") for the penalized regression estimation. We estimate the nodewise regression following the steps in Section 3.2 using the glmnet package ","element":"span"},{"href":"#id-73","text":"Friedman et al. ","element":"a"},{"href":"#id-73","text":"(2010) ","element":"a"},{"text":"for penalized regressions. We used two alternatives to select the regularization parameter ","element":"span"},{"style":{"height":10.8},"width":23,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-3.png","element":"img","alt":" λ","inline":true},{"text":", a 10-fold cross validation (CV), and the generalized information criterion (GIC) from ","element":"span"},{"href":"#id-74","text":"Zhang et al. ","element":"a"},{"href":"#id-74","text":"(2010)","element":"a"},{"text":".","element":"span"}],[{"text":"The GIC procedure starts by fitting ","element":"span"},{"style":{"height":13.44},"width":38.92,"height":33.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-4.png","element":"img","alt":" �γj","inline":true,"padRight":true},{"text":"in ","element":"span"},{"href":"#id-75","text":"(12) ","element":"a"},{"text":"for a range of ","element":"span"},{"style":{"height":15.5},"width":36.04,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-5.png","element":"img","alt":" λj","inline":true,"padRight":true},{"text":"that goes from the intercept-only model to the largest feasible model. This is automatically done by the glmnet package. Then, for the GIC procedure,","element":"span"}],[{"text":"we calculate the information criterion for a given ","element":"span"},{"style":{"height":15.5},"width":36.04,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-6.png","element":"img","alt":" λj","inline":true,"padRight":true},{"text":"among the ranges of all possible tuning parameters","element":"span"}],[{"style":{"width":"72%"},"width":1364,"height":85,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-7.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":16.71},"width":135.88,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-8.png","element":"img","alt":" SSR(λj","inline":true},{"text":") is the sum squared error for a given ","element":"span"},{"style":{"height":16.71},"width":137.32,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-9.png","element":"img","alt":" λj, q(λj","inline":true},{"text":") is the number of variables, given ","element":"span"},{"style":{"height":15.51},"width":50.84,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-10.png","element":"img","alt":" λj,","inline":true,"padRight":true},{"text":"in the model that is nonzero, and ","element":"span"},{"text":"p ","element":"span"},{"text":"is the number of assets. The last step is to select the model with the smallest GIC. Once this is done for all assets ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"text":", . . . , p","element":"span"},{"text":", we can proceed to obtain ","element":"span"},{"style":{"height":13.1},"width":92.64,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-11.png","element":"img","alt":"�ΓGIC","inline":true},{"text":".","element":"span"}],[{"text":"For the CV procedure, we split the sample into ","element":"span"},{"text":"k ","element":"span"},{"text":"subsamples and fit the model for a range of ","element":"span"},{"style":{"height":15.5},"width":36.04,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-12.png","element":"img","alt":" λj","inline":true,"padRight":true},{"text":"as in the GIC procedure. However, we will fit models in the subsamples. We always estimate the models in ","element":"span"},{"style":{"height":10.8},"width":60.76,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/25-13.png","element":"img","alt":" k −","inline":true,"padRight":true},{"text":"1 subsamples, leaving one subsample as a test sample, where we compute the mean squared error (MSE). After repeating the procedure using all ","element":"span"},{"text":"k ","element":"span"},{"text":"subsamples as a test, we finally compute the average MSE across all subsamples and select the ","element":"span"},{"style":{"height":15.5},"width":36.04,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-0.png","element":"img","alt":" λj","inline":true,"padRight":true},{"text":"for each asset ","element":"span"},{"text":"j ","element":"span"},{"text":"that yields the smallest average MSE. We can then use the estimated ","element":"span"},{"style":{"height":13.63},"width":38.92,"height":34.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-1.png","element":"img","alt":" �γj","inline":true,"padRight":true},{"text":"to obtain ","element":"span"},{"style":{"height":13.11},"width":77.8,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-2.png","element":"img","alt":"�ΓCV","inline":true,"padRight":true},{"text":".","element":"span"}],[{"text":"5.2 ","element":"span"},{"text":"Data Generation Process and Results","element":"span"}],[{"text":"The DGP is based on a simplified version of the factor DGP in ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019)","element":"a"},{"text":", for ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-3.png","element":"img","alt":", · · · , p","inline":true},{"text":":","element":"span"}],[{"style":{"width":"61%"},"width":1159,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-4.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":13.44},"width":37.96,"height":33.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-5.png","element":"img","alt":" yj","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":14.64},"width":44.36,"height":36.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-6.png","element":"img","alt":" f k","inline":true,"padRight":true},{"text":"are the monthly asset returns of asset ","element":"span"},{"text":"j","element":"span"},{"text":", factor returns of factor ","element":"span"},{"text":"k ","element":"span"},{"text":"respectively, ","element":"span"},{"style":{"height":15.9},"width":62.12,"height":39.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-7.png","element":"img","alt":" βj,k","inline":true,"padRight":true},{"text":"are the individual stock sensitivities to the factors, and ","element":"span"},{"style":{"height":13.9},"width":124.84,"height":34.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-8.png","element":"img","alt":" αj + ej","inline":true,"padRight":true},{"text":"represent the idiosyncratic component of each stock. We start with two specifications that correspond to two tables. Table 1 corresponds to 1 factor: excess return of the market portfolio, hence ","element":"span"},{"text":"K ","element":"span"},{"text":"= 1, and Table 2 corresponds to 3 factors from the Fama & French three factors, ","element":"span"},{"text":"K ","element":"span"},{"text":"= 3. ","element":"span"},{"style":{"height":7.6},"width":16,"height":19,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-9.png","element":"img","alt":"3","inline":true,"padRight":true},{"text":"Let ","element":"span"},{"style":{"height":13.44},"width":45.32,"height":33.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-10.png","element":"img","alt":" µf","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":15.5},"width":50.12,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-11.png","element":"img","alt":" Σf","inline":true,"padRight":true},{"text":"be the factors’ sample mean and covariance matrix. The ","element":"span"},{"style":{"height":14.4},"width":23,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-12.png","element":"img","alt":"β","inline":true},{"text":", and ","element":"span"},{"style":{"height":6.8},"width":26,"height":17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-13.png","element":"img","alt":" α","inline":true,"padRight":true},{"text":"and covariance matrix of residuals: ","element":"span"},{"style":{"height":13.11},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-14.png","element":"img","alt":"�Σn","inline":true,"padRight":true},{"text":"are estimated using a simple least-squares regression using returns from the S&P500 stocks that were part of the index in the entire period from 2008 to 2017. In each simulation, we randomly select ","element":"span"},{"text":"p ","element":"span"},{"text":"stocks from the pool with replacement because our simulations require more than the total number of available stocks. We then used the selected stocks to generate individual returns with covariance matrix of errors: ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-15.png","element":"img","alt":"�Σn","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":16},"width":290.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-16.png","element":"img","alt":"�Σn ⊙ T oeplitz(ρ","inline":true},{"text":"), where ","element":"span"},{"style":{"height":16},"width":185.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-17.png","element":"img","alt":" T oeplitz(ρ","inline":true},{"text":") is the ","element":"span"},{"style":{"height":11.2},"width":91.12,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-18.png","element":"img","alt":" p × p","inline":true,"padRight":true},{"text":"matrix of","element":"span"}],[{"text":"the form, for (i,j)th element","element":"span"}],[{"style":{"width":"21%"},"width":402,"height":50,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-19.png","element":"img"}],[{"text":"with ","element":"span"},{"style":{"height":10},"width":21,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-20.png","element":"img","alt":" ρ","inline":true,"padRight":true},{"text":"= 0","element":"span"},{"text":".","element":"span"},{"text":"25","element":"span"},{"text":", ","element":"span"},{"text":"0","element":"span"},{"text":".","element":"span"},{"text":"5","element":"span"},{"text":", ","element":"span"},{"text":"0","element":"span"},{"text":".","element":"span"},{"text":"75. ","element":"span"},{"style":{"height":12.8},"width":121.4,"height":32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-21.png","element":"img","alt":" A ⊙ B","inline":true,"padRight":true},{"text":"represents element by element multiplication (Hadamard product) of two square matrices ","element":"span"},{"text":"A","element":"span"},{"text":", ","element":"span"},{"text":"B ","element":"span"},{"text":"of the same dimensions.","element":"span"}],[{"text":"Tables 1-2 show the results. ","element":"span"},{"text":"The values in each cell show the average absolute estimation error for estimating the square of the Sharpe Ratio. Each eight-column block in the table shows the results for a different sample size. In each of these blocks, the first four columns are for ","element":"span"},{"text":"p ","element":"span"},{"text":"= ","element":"span"},{"text":"n/","element":"span"},{"text":"2, and the last four columns are for ","element":"span"},{"text":"p ","element":"span"},{"text":"= 3","element":"span"},{"text":"n/","element":"span"},{"text":"2. MSR, MSR-OOS, GMV-SR, and MKW-SR are the constrained maximum Sharpe Ratio, the out-of-sample maximum Sharpe Ratio, the Sharpe Ratio from the global minimum-variance portfolio, and the Sharpe Ratio from the Markowitz portfolio with target returns set to 1%, respectively. Therefore, there are four categories to evaluate the different estimates. The MAXSER risk constraint was set to 0.04 following ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019)","element":"a"},{"text":". We ran 100 iterations in each simulation setup. All bold-face entries in tables show category champions.","element":"span"}],[{"text":"Both Tables show that our method achieves consistency, as shown in Theorems. Analyzing ","element":"span"},{"text":"K ","element":"span"},{"text":"= 3, Table 2, with ","element":"span"},{"style":{"height":10},"width":21,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/26-22.png","element":"img","alt":" ρ","inline":true,"padRight":true},{"text":"= 0","element":"span"},{"text":".","element":"span"},{"text":"50 OOS-MSR (the Out Of Sample-Maximum Sharpe Ratio), and Generalized Information Criterion tuning parameter selection, the estimation error at ","element":"span"},{"text":"p ","element":"span"},{"text":"= ","element":"span"},{"text":"n/","element":"span"},{"text":"2, with ","element":"span"},{"text":"n ","element":"span"},{"text":"= 100 is 1.244, and this error declines to 0.585 at ","element":"span"},{"text":"p ","element":"span"},{"text":"= ","element":"span"},{"text":"n/","element":"span"},{"text":"2","element":"span"},{"text":", n ","element":"span"},{"text":"= 200, and then declines to 0.321 at ","element":"span"},{"text":"p ","element":"span"},{"text":"= ","element":"span"},{"text":"n/","element":"span"},{"text":"2","element":"span"},{"text":", n ","element":"span"},{"text":"= 400. So with jointly increasing ","element":"span"},{"text":"n, p ","element":"span"},{"text":"we show that the error declines, as predicted by our theorems. The main reason is that errors grow with","element":"span"},{"style":{"height":19.2},"width":40,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/27-0.png","element":"img","alt":"�","inline":true},{"text":"ln(","element":"span"},{"text":"p","element":"span"},{"text":"), but decline with ","element":"span"},{"style":{"height":14.16},"width":72.16,"height":35.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/27-1.png","element":"img","alt":" n1/2","inline":true,"padRight":true},{"text":"rate. So the number of assets in a large portfolio only affects the error logarithmically. To give another example from Table 2, with ","element":"span"},{"style":{"height":10},"width":21,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/27-2.png","element":"img","alt":" ρ","inline":true,"padRight":true},{"text":"= 0","element":"span"},{"text":".","element":"span"},{"text":"50, GMV-SR (Global Minimum Variance-Sharpe Ratio) and Cross Validation tuning parameter selection with our method, the estimation error is 0.352 with ","element":"span"},{"text":"p ","element":"span"},{"text":"= 3","element":"span"},{"text":"n/","element":"span"},{"text":"2","element":"span"},{"text":", n ","element":"span"},{"text":"= 100, then this error declines to 0.213 with ","element":"span"},{"text":"p ","element":"span"},{"text":"= 3","element":"span"},{"text":"n/","element":"span"},{"text":"2","element":"span"},{"text":", n ","element":"span"},{"text":"= 200, and further declines to 0.143 with ","element":"span"},{"text":"p ","element":"span"},{"text":"= 3","element":"span"},{"text":"n/","element":"span"},{"text":"2","element":"span"},{"text":", n ","element":"span"},{"text":"= 400.","element":"span"}],[{"text":"Next, we consider which method achieves the smallest estimation error. ","element":"span"},{"text":"Table 1 favors SF-NL-LW (Single Factor Non-Linear Shrinkage of Ledoit-Wolf) since it has a single factor built into this subset of their technique. We get better results in Table 2 (","element":"span"},{"text":"K ","element":"span"},{"text":"= 3) for our methods. We have 4 categories: MSR, OOS-MSR, GMV-SR, MKW-SR corresponding to our Theorems 3-9. There are nine possibilities in each category (given we are either at ","element":"span"},{"text":"p ","element":"span"},{"text":"= ","element":"span"},{"text":"n/","element":"span"},{"text":"2 or ","element":"span"},{"text":"p ","element":"span"},{"text":"= 3","element":"span"},{"text":"n/","element":"span"},{"text":"2), representing three choices of sample sizes paired with 3 choices of different Toeplitz structures.","element":"span"}],[{"text":"We analyze each category. We start with Table 1. With ","element":"span"},{"text":"p ","element":"span"},{"text":"= 3","element":"span"},{"text":"n/","element":"span"},{"text":"2 in OOS-MSR our NW-GIC method has the smallest errors 8 out of 9 categories. When ","element":"span"},{"text":"p ","element":"span"},{"text":"= ","element":"span"},{"text":"n/","element":"span"},{"text":"2, MAXSER method dominates all others since it is specifically factor model designed to handle OOS-MSR with ","element":"span"},{"text":"p < n","element":"span"},{"text":". In GMV-SR, with ","element":"span"},{"text":"p ","element":"span"},{"text":"= ","element":"span"},{"text":"n/","element":"span"},{"text":"2, in 3 out of 9 cases, our NW-GIC dominates. In the other categories in Table 1, non-linear shrinkage method of Ledoit-Wolf (2017) does the best, but our methods come a very close second.","element":"span"}],[{"text":"In Table 2, with ","element":"span"},{"text":"K ","element":"span"},{"text":"= 3, our methods perform better than in Table 1. In the category of GMV-SR, with ","element":"span"},{"text":"p ","element":"span"},{"text":"= 3","element":"span"},{"text":"n/","element":"span"},{"text":"2, out of 9 possible configurations, our methods have the smallest error in 7 cases. Our methods dominate in the same category, with ","element":"span"},{"text":"p ","element":"span"},{"text":"= 0","element":"span"},{"text":".","element":"span"},{"text":"5","element":"span"},{"text":"n","element":"span"},{"text":", 5 out of 9 possibilities. In the case of the category of MKW-SR (Markowitz-Sharpe Ratio), our theorems predict that our methods may suffer from a number of factors. We see that non-linear shrinkage methods are the best, and our methods are the second best in this category. In the constrained maximum Sharpe Ratio, (MSR) non-linear shrinkage methods perform the best.","element":"span"}],[{"style":{"width":"110%"},"width":2716,"height":1061,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/28-0.png","element":"img"}],[{"style":{"width":"110%"},"width":2716,"height":1061,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/29-0.png","element":"img"}]]},{"heading":"6 Empirical Application","paragraphs":[[{"text":"For the empirical application, we use two subsamples. The first subsample uses data from January 1995 to December 2019 with an out-of-sample period from January 2005 to December 2019. We selected all stocks in the S&P 500 index for at least one month in the out-of-sample period and have data for the entire 1995-2019 period resulting in 382 stocks. The second subsample starts in January 1990 and ends in December 2019 with an out-of-sample period from January 2000 to December 2019. Using the same criterion as the first subsample, the number of stocks was 321, which is around 15% fewer than the first subsample. The objective is to have an out-of-sample competition between models, and we only estimated GMV and Markowitz portfolios for the plug-in estimators. The first out-of-sample period includes only the recession of 2008. The second out-of-sample period includes the recessions of 2000 and 2008, and the out-of-sample periods reflect recent history.","element":"span"}],[{"text":"The Markowitz return constraint ","element":"span"},{"style":{"height":10},"width":36.64,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-0.png","element":"img","alt":" ρ1","inline":true,"padRight":true},{"text":"is 0.8% per month, and the MAXSER risk constraint is 4%. In the low-dimensional experiment, we randomly select 50 stocks from the pool to estimate the models with the same stocks for all windows. We also experimented with 25 stocks but did not report them. That table is available from the authors on demand. In the high-dimensional case, we use all available stock","element":"span"},{"href":"#id-9","text":"s.","element":"a"}],[{"text":"We use a rolling window setup for the out-of-sample estimation of the Sharpe Ratio following ","element":"span"},{"href":"#id-9","text":"Callot et al. ","element":"a"},{"href":"#id-9","text":"(2021)","element":"a"},{"text":". Specifically, samples of size ","element":"span"},{"text":"n ","element":"span"},{"text":"are divided into in-sample (1 : ","element":"span"},{"style":{"height":9.1},"width":39,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-1.png","element":"img","alt":" nI","inline":true},{"text":") and out-of-sample (","element":"span"},{"style":{"height":9.1},"width":39,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-2.png","element":"img","alt":"nI","inline":true,"padRight":true},{"text":"+ 1 : ","element":"span"},{"text":"n","element":"span"},{"text":"). We start by estimating the portfolio ","element":"span"},{"style":{"height":11.44},"width":67.24,"height":28.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-3.png","element":"img","alt":" �wnI","inline":true,"padRight":true},{"text":"in the in-sample period and the out-of-sample portfolio returns ","element":"span"},{"style":{"height":13.25},"width":159.92,"height":33.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-4.png","element":"img","alt":"�w′nIynI+1","inline":true},{"text":". Then, we roll the window by one element (2 : ","element":"span"},{"style":{"height":9.1},"width":39,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-5.png","element":"img","alt":" nI","inline":true,"padRight":true},{"text":"+ 1) and form a new in-sample portfolio ","element":"span"},{"style":{"height":13.04},"width":102.8,"height":32.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-6.png","element":"img","alt":" �wnI+1","inline":true,"padRight":true},{"text":"and out-of-sample portfolio returns ","element":"span"},{"style":{"height":14.85},"width":193.52,"height":37.12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-7.png","element":"img","alt":" �w′nI+1ynI+2","inline":true},{"text":". This procedure is repeated until the end of the sample.","element":"span"}],[{"text":"The out-of-sample average return and variance without transaction costs are","element":"span"}],[{"style":{"width":"65%"},"width":1220,"height":119,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-8.png","element":"img"}],[{"text":"We estimate the Sharpe Ratios with and without transaction costs. The transaction cost, ","element":"span"},{"text":"c","element":"span"},{"text":", is defined as 50 basis points following ","element":"span"},{"href":"#id-76","text":"DeMiguel et al. ","element":"a"},{"href":"#id-76","text":"(2007)","element":"a"},{"text":". Let ","element":"span"},{"style":{"height":13.31},"width":289.6,"height":33.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-9.png","element":"img","alt":" yP,t+1 = �w′tyt+1","inline":true,"padRight":true},{"text":"be the return of the portfolio in","element":"span"}],[{"text":"period ","element":"span"},{"text":"t ","element":"span"},{"text":"+ 1; in the presence of transaction costs, the returns will be defined as","element":"span"}],[{"style":{"width":"45%"},"width":856,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-10.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":20.93},"width":62.92,"height":52.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-11.png","element":"img","alt":" �w+t,j","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":11.5},"width":62.92,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-12.png","element":"img","alt":" �wt,j","inline":true},{"text":"(1 + ","element":"span"},{"style":{"height":16.7},"width":132.8,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-13.png","element":"img","alt":" yt+1,j)/","inline":true},{"text":"(1 + ","element":"span"},{"style":{"height":11.5},"width":105.6,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-14.png","element":"img","alt":" yt+1,P","inline":true},{"text":") and ","element":"span"},{"style":{"height":11.5},"width":54.28,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-15.png","element":"img","alt":" yt,j","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":11.5},"width":65.28,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-16.png","element":"img","alt":" yt,P","inline":true,"padRight":true},{"text":"are the excess returns of asset ","element":"span"},{"text":"j ","element":"span"},{"text":"and the portfolio ","element":"span"},{"text":"P ","element":"span"},{"text":"added to the risk-free rate. The adjustment made in ","element":"span"},{"style":{"height":20.74},"width":62.92,"height":51.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-17.png","element":"img","alt":" �w+t,j","inline":true,"padRight":true},{"text":"is because the portfolio at the end of the period ","element":"span"},{"text":"has changed compared to the portfolio at the beginning of the period.","element":"span"}],[{"text":"The Sharpe Ratio is calculated from the average return and the variance of the portfolio in the out-of-","element":"span"}],[{"text":"sample period","element":"span"}],[{"style":{"width":"11%"},"width":219,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/30-18.png","element":"img"}],[{"text":"The portfolio returns are replaced by the returns with transaction costs when we calculate the Sharpe Ratio with transaction costs.","element":"span"}],[{"text":"We use the same test as ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019) ","element":"a"},{"text":"to compare the models. Specifically,","element":"span"}],[{"style":{"width":"71%"},"width":1345,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/31-0.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":13.1},"width":118.44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/31-1.png","element":"img","alt":" SRNW","inline":true,"padRight":true},{"text":"is the Sharpe Ratio of our feasible nodewise model, which is tested against all remaining models. This is the ","element":"span"},{"href":"#id-77","text":"Jobson and Korkie ","element":"a"},{"href":"#id-77","text":"(1981) ","element":"a"},{"text":"test with ","element":"span"},{"href":"#id-78","text":"Memmel ","element":"a"},{"href":"#id-78","text":"(2003) ","element":"a"},{"text":"correction. We also considered the method of ","element":"span"},{"href":"#id-79","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-79","text":"(2008) ","element":"a"},{"text":"for testing the significance of the winner and using the equally weighted portfolio as a benchmark; the results were very similar and hence are not reported.","element":"span"}],[{"text":"We also include an equally weighted portfolio (EW). GMV-NW-GIC and GMV-NW-CV denote the nodewise method with GIC and cross validation tuning parameter choices, respectively, in the global minimum-variance portfolio (GMV).","element":"span"}],[{"text":"In each of our feasible nodewise models with GIC, CV, we either use a single-factor model (market as the only factor) or three-factor model. They are denoted GMV-NW-GIC-SF, GMV-NW-GIC-3F for the global minimum variance portfolio analyzed with feasible nodewise method and GIC criterion for tuning parameter choice and single and three-factor models, respectively. In the same way, we define GMV-NW-CV-SF, GMV-NW-CV-3F. We take GMV-NW-GIC-SF as the benchmark to test against all other methods since it generally does well in different preliminary forecasts.","element":"span"}],[{"text":"GMV-POET, GMV-NL-LW, and GMV-SF-NL-LW denote the POET, nonlinear shrinkage, and single-factor nonlinear shrinkage methods, respectively, which are described in the simulation section and also used in the global minimum-variance portfolio. The MAXSER is also used and explained in the simulation section. MW denotes the Markowitz mean-variance portfolio, and MW-NW-GIC-SF denotes the feasible nodewise method with GIC tuning parameter selection in the Markowitz portfolio with a single factor. All the other methods with MW headers are analogous and thus self-explanatory.","element":"span"}],[{"text":"The results are presented in Tables ","element":"span"},{"href":"#id-80","text":"3 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-81","text":"4. ","element":"a"},{"text":"Table ","element":"span"},{"href":"#id-80","text":"3 ","element":"a"},{"text":"shows the results for the 2005-2019 out-of-sample period. Feasible nodewise methods do well in terms of the Sharpe Ratio in Table ","element":"span"},{"href":"#id-80","text":"3. ","element":"a"},{"text":"For example, with transaction costs in the low-dimensional portfolio category, in terms of Sharpe Ratio (SR) (averaged over the out-of-sample time period), GMV-NW-GIC-SF is the best model. It has an SR of 0.210. In the case of high dimensional case with transaction costs in the same table, GMV-POET and our GMV-NW-GIC-SF virtually tie (difference in favor of POET in fourth decimal) at 0.214 for the Sharpe Ratio.","element":"span"}],[{"text":"If we were to analyze only the Markowitz portfolio in Table ","element":"span"},{"href":"#id-80","text":"3, ","element":"a"},{"text":"with transaction costs in high dimensions, MW-NW-GIC-SF has the highest SR of 0.211. Therefore, even in other subcategories of Markowitz portfolio, the feasible nodewise method dominates. Although statistical significance is not established, it is unclear that these significance tests have high power in our high-dimensional cases.","element":"span"}],[{"text":"Table ","element":"span"},{"href":"#id-81","text":"4 ","element":"a"},{"text":"shows the results for the out-of-sample January 2000-2019 subsample. ","element":"span"},{"text":"We see that feasible nodewise methods dominate all scenarios except for the low-dimensional case with transaction costs. In high dimensionality with transaction costs, GMV-NW-GIC-SF (Markowitz-nodewise-GIC) has an SR of 0.225, and the closest is GMV-POET with 0.204. Also, we experimented with two other out-sample periods of 2005-2017, 2000-2017, and the results are slightly better for our methods, and these can be shared on demand.","element":"span"}],[{"id":"id-80","text":"Table 3: Empirical Results – Out-of-Sample Period from Jan. 2005 to Dec. 2019","element":"figcaption","subtype":"caption"}],[{"style":{"width":"100%"},"width":1874,"height":858,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/32-0.png","element":"img"}],[{"text":"The table shows the Sharpe Ratio (SR), average returns (Avg), standard deviation (SD) and p-value of the ","element":"span"},{"href":"#id-77","text":"Jobson and Korkie ","element":"a"},{"href":"#id-77","text":"(1981) ","element":"a"},{"text":"test with ","element":"span"},{"href":"#id-78","text":"Memmel ","element":"a"},{"href":"#id-78","text":"(2003) ","element":"a"},{"text":"correction. We also applied the ","element":"span"},{"href":"#id-79","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-79","text":"(2008) ","element":"a"},{"text":"test with circular bootstrap, and the results were very similar; therefore we only report those of the first test in this table. The statistics were calculated from 180 rolling windows covering the period from Jan. 2005 to Dec. 2019, and the size of the estimation window was 120 observations.","element":"span"}],[{"text":"In Table ","element":"span"},{"href":"#id-82","text":"5, ","element":"a"},{"text":"we analyze turnover, leverage and maximum leverage (equations ","element":"span"},{"href":"#id-83","text":"(40)","element":"a"},{"text":", ","element":"span"},{"href":"#id-84","text":"(41) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-85","text":"(42)","element":"a"},{"text":", respectively) of the portfolios in Tables ","element":"span"},{"href":"#id-80","text":"3-","element":"a"},{"href":"#id-81","text":"4.","element":"a"}],[{"text":"The definitions are as follows for turnover:","element":"span"}],[{"id":"id-83","style":{"width":"63%"},"width":1192,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/32-1.png","element":"img"}],[{"text":"and leverage","element":"span"}],[{"id":"id-84","style":{"width":"64%"},"width":1207,"height":138,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/32-2.png","element":"img"}],[{"text":"and maximum leverage","element":"span"}],[{"id":"id-85","style":{"width":"68%"},"width":1274,"height":67,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/32-3.png","element":"img"}],[{"text":"It is clear that in Table ","element":"span"},{"href":"#id-82","text":"5 ","element":"a"},{"text":"in terms of turnover, leverage, maximum leverage, GMV-POET and GMV-NW-GIC-SF do well, with the best and close to best respectively if we discount EW portfolios.","element":"span"}],[{"text":"6.1 ","element":"span"},{"text":"Time Series of Sharpe Ratios and Turnover","element":"span"}],[{"text":"Figures ","element":"span"},{"href":"#id-86","text":"1 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-87","text":"2 ","element":"a"},{"text":"shows Global Minimum Variance results of the NW-GIF-SF, the POET and the SF-NL-LW models with transaction costs. The results were obtained through a 24 months rolling window with the","element":"span"}],[{"id":"id-81","text":"Table 4: Empirical Results – Out-of-Sample Period from Jan. 2000 to Dec. 2019","element":"figcaption","subtype":"caption"}],[{"style":{"width":"100%"},"width":1874,"height":854,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/33-0.png","element":"img"}],[{"text":"The table shows the Sharpe Ratio (SR), average returns (Avg), standard deviation (SD) and p-value of the ","element":"span"},{"href":"#id-77","text":"Jobson and Korkie ","element":"a"},{"href":"#id-77","text":"(1981) ","element":"a"},{"text":"test with ","element":"span"},{"href":"#id-78","text":"Memmel ","element":"a"},{"href":"#id-78","text":"(2003) ","element":"a"},{"text":"correction. We also applied the ","element":"span"},{"href":"#id-79","text":"Ledoit and Wolf ","element":"a"},{"href":"#id-79","text":"(2008) ","element":"a"},{"text":"test with circular bootstrap, and the results were very similar; therefore we only report those of the first test in this table. The statistics were calculated from 240 rolling windows covering the period from Jan. 2005 to Dec. 2019, and the size of the estimation window was 120 observations.","element":"span"}],[{"text":"out-of-sample returns from the 2000-2019 experiment, which yields time-series that start in 2002 and end in 2019 for the Sharpe Ratio and the turnover. The main conclusion from the figures is that Nodewise works better in terms of the Sharpe Ratio in deep recessions like the 2008 crisis, but Nonlinear Shrinkage and POET are superior when we have long periods of normality in the markets. Nodewise also delivers better Sharpe Ratios during the recovery of the crisis. On the turnover side, Nodewise and POET consistently have lower turnover than Nonlinear Shrinkage with POET being the overall lowest. However, during the 2008 crisis, especially in the high dimension setup, POET had a higher turnover than Nodewise.","element":"span"}],[{"id":"id-82","text":"Table 5: Turnover and Leverage","element":"figcaption","subtype":"caption"}],[{"style":{"width":"65%"},"width":1218,"height":1907,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/34-0.png","element":"img"}],[{"style":{"width":"78%"},"width":1466,"height":970,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/35-0.png","element":"img"}],[{"id":"id-86","text":"Figure 1: 24 months rolling Sharpe Ratio and turnover - Low Dimension with transaction costs","element":"figcaption","subtype":"caption"}]]},{"heading":"7 Conclusion","paragraphs":[[{"text":"We provide a hybrid factor model combined with nodewise regression method that can control for risk and obtain the maximum expected return of a large portfolio. Our result is novel and holds even when ","element":"span"},{"text":"p > n","element":"span"},{"text":". We allow for an increasing number of factors, with possible unbounded largest eigenvalue of the covariance matrix of errors. Sparsity is assumed on the precision matrix of errors rather than the covariance matrix of errors. We also show that the maximum out-of-sample Sharpe Ratio can be estimated consistently. Furthermore, we also develop a formula for the maximum Sharpe Ratio when the sum of the weights of the portfolio is one. A consistent estimate for the constrained case is also shown. Then, we extended our results to the consistent estimation of the Sharpe Ratios in two widely used portfolios in the literature. It will be essential to extend our results to more restrictions on portfolios.","element":"span"}]]},{"heading":"References","paragraphs":[[{"id":"id-208","text":"Abadir, K. and J. Magnus (2005). ","element":"span"},{"text":"Matrix Algebra","element":"span"},{"text":". Cambridge University Press.","element":"span"}],[{"id":"id-16","text":"Ao, M., Y. Li, and X. Zheng (2019). Approaching mean-variance efficiency for large portfolios. ","element":"span"},{"text":"Review of Financial Studies 32","element":"span"},{"text":", 2499–2540.","element":"span"}],[{"id":"id-25","text":"Barras, L., P. Gagliardini, and O. Scaillet (2021+). Skill, scale, and value creation in the mutual fund ","element":"span"},{"text":"industry. ","element":"span"},{"text":"Journal of Finance","element":"span"},{"text":". forthcoming.","element":"span"}],[{"style":{"width":"78%"},"width":1466,"height":970,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/36-0.png","element":"img"}],[{"id":"id-87","text":"Figure 2: 24 months rolling Sharpe Ratio and turnover - High Dimension with transaction costs","element":"figcaption","subtype":"caption"}],[{"id":"id-5","text":"Brito, D., M. Medeiros, and R. Ribeiro (2018). Forecasting large realized covariance matrices: The benefits","element":"span"}],[{"style":{"width":"61%"},"width":1149,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/36-1.png","element":"img"}],[{"id":"id-26","text":"Brodie, J., I. Daubechies, C. D. Mol, D. Giannone, and I. Loris (2009). ","element":"span"},{"text":"Sparse and stable Markowitz","element":"span"}],[{"style":{"width":"73%"},"width":1373,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/36-2.png","element":"img"}],[{"id":"id-9","text":"Callot, L., M. Caner, O. Onder, and E. Ulasan (2021). A nodewise regression approach to estimating large","element":"span"}],[{"style":{"width":"64%"},"width":1201,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/36-3.png","element":"img"}],[{"id":"id-8","text":"Caner, M. and A. Kock (2018). Asymptotically honest confidence regions for high dimensional parameters","element":"span"}],[{"style":{"width":"72%"},"width":1353,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/36-4.png","element":"img"}],[{"id":"id-27","text":"Chamberlain, G. and M. Rothschild (1983). Arbitrage, factor structure, and mean-variance analysis on large","element":"span"}],[{"style":{"width":"41%"},"width":770,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/36-5.png","element":"img"}],[{"id":"id-7","text":"Chang, J., Y. Qiu, Q. Yao, and T. Zou (2018). Confidence regions for entries of a large precision matrix.","element":"span"}],[{"style":{"width":"33%"},"width":636,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/36-6.png","element":"img"}],[{"id":"id-28","text":"DeMiguel, V., L. Garlappi, F. Nogales, and R. Uppal (2009). A generalized approach to portfolio optimiza-","element":"span"}],[{"style":{"width":"90%"},"width":1691,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/36-7.png","element":"img"}],[{"id":"id-76","text":"DeMiguel, V., L. Garlappi, and R. Uppal (2007). Optimal versus naive diversification: How inefficient is the","element":"span"}],[{"style":{"width":"68%"},"width":1291,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/36-8.png","element":"img"}],[{"id":"id-24","text":"Ding, Y., Y. Li, and X. Zheng (2021). ","element":"span"},{"text":"High-dimensional minimum variance portfolio estimation under","element":"span"}],[{"style":{"width":"60%"},"width":1124,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-0.png","element":"img"}],[{"id":"id-209","text":"Fan, J., Y. Fan, and J. Lv (2008). High-dimensional covariance matrix estimation using a factor model.","element":"span"}],[{"style":{"width":"36%"},"width":676,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-1.png","element":"img"}],[{"id":"id-4","text":"Fan, J., A. Furger, and D. Xiu (2016). Incorporating global industrial classification standard into portfolio","element":"span"}],[{"style":{"width":"98%"},"width":1837,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-2.png","element":"img"}],[{"id":"id-21","text":"Fan, J., Y. Li, and K. Yu (2012). Vast volatility matrix estimation using high frequency data for portfolio","element":"span"}],[{"style":{"width":"66%"},"width":1253,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-3.png","element":"img"}],[{"id":"id-17","text":"Fan, J., Y. Liao, and M. Mincheva (2011). High-dimensional covariance matrix estimation in approximate","element":"span"}],[{"style":{"width":"51%"},"width":955,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-4.png","element":"img"}],[{"id":"id-69","text":"Fan, J., Y. Liao, and M. Mincheva (2013). Large covariance estimation by thresholding principal orthogonal","element":"span"}],[{"style":{"width":"97%"},"width":1830,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-5.png","element":"img"}],[{"id":"id-70","text":"Fan, J., Y. Liao, and M. Mincheva (2016). ","element":"span"},{"text":"POET: Principal Orthogonal Complement Thresholding (POET)","element":"span"}],[{"style":{"width":"28%"},"width":539,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-6.png","element":"img"}],[{"id":"id-29","text":"Fan, J., Y. Liao, and X. Shi (2015). Risks of large portfolios. ","element":"span"},{"text":"Journal of Econometrics 186","element":"span"},{"text":", 367–387.","element":"span"}],[{"id":"id-3","text":"Fan, J., H. Liu, and W. Wang (2018). Large covariance estimation through elliptical factor models. ","element":"span"},{"text":"Annals","element":"span"}],[{"style":{"width":"25%"},"width":472,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-7.png","element":"img"}],[{"id":"id-10","text":"Fan, J., R. Masini, and M. Medeiros (2021). Bridging factor and sparse models. arxiv:2102.11341, arXiv.","element":"span"}],[{"id":"id-73","text":"Friedman, J., T. Hastie, and R. Tibshirani (2010). Regularization paths for generalized linear models via","element":"span"}],[{"style":{"width":"55%"},"width":1043,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-8.png","element":"img"}],[{"id":"id-11","text":"Gagliardini, P., E. Ossola, and O. Scaillet (2016). Time-varying risk premium in large cross-sectional equity","element":"span"}],[{"style":{"width":"35%"},"width":670,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-9.png","element":"img"}],[{"id":"id-13","text":"Gagliardini, P., E. Ossola, and O. Scaillet (2019). A diagnostic criterion for approximate factor structure.","element":"span"}],[{"style":{"width":"36%"},"width":676,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-10.png","element":"img"}],[{"id":"id-14","text":"Gagliardini, P., E. Ossola, and O. Scaillet (2020). Estimation of large dimensional conditional factor models","element":"span"}],[{"style":{"width":"47%"},"width":897,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-11.png","element":"img"}],[{"id":"id-19","text":"Garlappi, L., R. Uppal, and T. Wang (2007). Portfolio selection with parameter and model uncertainty: A","element":"span"}],[{"style":{"width":"56%"},"width":1065,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/37-12.png","element":"img"}],[{"id":"id-72","text":"Hastie, T. and B. Efron (2013). ","element":"span"},{"text":"lars: Least Angle Regression, Lasso and Forward Stagewise","element":"span"},{"text":". R package","element":"span"}],[{"style":{"width":"10%"},"width":195,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-0.png","element":"img"}],[{"id":"id-31","text":"Horn, R. and C. Johnson (2013). ","element":"span"},{"text":"Matrix Analysis","element":"span"},{"text":". Cambridge University Press.","element":"span"}],[{"id":"id-20","text":"Jagannathan, R. and T. Ma (2003). Risk reduction in large portfolios: Why imposing the wrong constraints","element":"span"}],[{"style":{"width":"42%"},"width":801,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-1.png","element":"img"}],[{"id":"id-77","text":"Jobson, J. D. and B. M. Korkie (1981). Performance hypothesis testing with the sharpe and treynor measures.","element":"span"}],[{"style":{"width":"37%"},"width":694,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-2.png","element":"img"}],[{"id":"id-22","text":"Kan, R. and G. Zhou (2007). Optimal portfolio choice with parameter uncertainty. ","element":"span"},{"text":"Journal of Financial","element":"span"}],[{"style":{"width":"27%"},"width":514,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-3.png","element":"img"}],[{"id":"id-18","text":"Lai, T., H. Xing, and Z. Chen (2011). Mean-variance portfolio optimization when means and covariances","element":"span"}],[{"style":{"width":"54%"},"width":1026,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-4.png","element":"img"}],[{"text":"Ledoit, O, M. and M. Wolf (2003). Improved estimation of the covariance matrix of stock returns with an","element":"span"}],[{"style":{"width":"71%"},"width":1337,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-5.png","element":"img"}],[{"text":"Ledoit, O, M. and M. Wolf (2004). A well conditioned estimator for large dimensional covariance matrices.","element":"span"}],[{"style":{"width":"42%"},"width":794,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-6.png","element":"img"}],[{"id":"id-15","text":"Ledoit, O, M. and M. Wolf (2017). Nonlinear shrinkage of the covariance matrix for portfolio selection:","element":"span"}],[{"style":{"width":"67%"},"width":1264,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-7.png","element":"img"}],[{"id":"id-79","text":"Ledoit, O. and M. Wolf (2008). Robust performance hypothesis testing with the Sharpe ratio. ","element":"span"},{"text":"Journal of","element":"span"}],[{"style":{"width":"29%"},"width":549,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-8.png","element":"img"}],[{"id":"id-2","text":"Maller, R., S. Roberts, and R. Tourky (2016). The large sample distribution of the maximum sharpe ratio","element":"span"}],[{"style":{"width":"64%"},"width":1207,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-9.png","element":"img"}],[{"id":"id-1","text":"Maller, R. and D. Turkington (2002). New light on portfolio allocation problem. ","element":"span"},{"text":"Mathematical Methods of","element":"span"}],[{"style":{"width":"30%"},"width":579,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-10.png","element":"img"}],[{"id":"id-60","text":"Markowitz, H. (1952). Portfolio selection. ","element":"span"},{"text":"Journal of Finance 7","element":"span"},{"text":", 77–91.","element":"span"}],[{"id":"id-0","text":"Meinshausen, N. and P. B¨uhlmann (2006). High-dimensional graphs and variable selection with the lasso.","element":"span"}],[{"style":{"width":"33%"},"width":631,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-11.png","element":"img"}],[{"id":"id-78","text":"Memmel, C. (2003). Performance hypothesis testing with the sharpe ratio. ","element":"span"},{"text":"Finance Letters 1","element":"span"},{"text":"(1).","element":"span"}],[{"id":"id-91","text":"Merlevede, F., M. Peligrad, and E. Rio (2011). A Bernstein type inequality and moderate deviations for","element":"span"}],[{"style":{"width":"76%"},"width":1427,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/38-12.png","element":"img"}],[{"id":"id-71","text":"Ramprasad, P. (2016). ","element":"span"},{"text":"nlshrink: Non-Linear Shrinkage Estimation of Population Eigenvalues and Covari-","element":"span"}],[{"style":{"width":"36%"},"width":683,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/39-0.png","element":"img"}],[{"id":"id-6","text":"Senneret, M., Y. Malevergne, P. Abry, G. Perrin, and L. Jaffr`es (2016). Covariance versus precision matrix","element":"span"}],[{"style":{"width":"97%"},"width":1830,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/39-1.png","element":"img"}],[{"id":"id-50","text":"Shanken, J. (1992). The current state of the arbitrage pricing theory. ","element":"span"},{"text":"Journal of Finance 47","element":"span"},{"text":", 1569–1574.","element":"span"}],[{"id":"id-23","text":"Tu, J. and G. Zhou (2011). Markowitz meets talmud: A combination of sophisticated and naive diversification","element":"span"}],[{"style":{"width":"52%"},"width":981,"height":38,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/39-2.png","element":"img"}],[{"id":"id-90","text":"van de Geer, S. (2016). ","element":"span"},{"text":"Estimation and testing under sparsity","element":"span"},{"text":". Springer-Verlag.","element":"span"}],[{"id":"id-74","text":"Zhang, Y., R. Li, and C.-L. Tsai (2010). Regularization parameter selections via generalized information","element":"span"}],[{"style":{"width":"71%"},"width":1347,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/39-3.png","element":"img"}],[{"style":{"width":"47%"},"width":889,"height":75,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-0.png","element":"img"}],[{"text":"Sharpe Ratio Analysis in High Dimensions: Residual-Based Nodewise Regression in Factor Models","element":"span"}],[{"style":{"width":"94%"},"width":1771,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-1.png","element":"img"}]]},{"heading":"Supplement A","paragraphs":[[{"text":"Supplement A is divided into several parts. The first part has preliminary proofs, norm inequalities, defi-nitions, and a maximal inequality that is extended in a very minor form from the existing literature. The second part has the proofs of lemmata that lead to proof of Theorem ","element":"span"},{"href":"#id-43","text":"1. ","element":"a"},{"text":"The first two parts relate only to the proof of Theorem ","element":"span"},{"href":"#id-43","text":"1. ","element":"a"},{"text":"The third part is only related to the proof of Theorem ","element":"span"},{"href":"#id-88","text":"2. ","element":"a"},{"text":"Part 4 is related to all the remaining proofs of the theorems in this paper.","element":"span"}],[{"text":"Part 1","element":"span"}],[{"text":"We start with a lemma that provides norm inequalities. Let ","element":"span"},{"style":{"height":14.8},"width":415.68,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-2.png","element":"img","alt":" A1 : p × K, B1 : K × K","inline":true,"padRight":true},{"text":"matrices and ","element":"span"},{"style":{"height":10.8},"width":130.36,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-3.png","element":"img","alt":" x : K ×","inline":true,"padRight":true},{"text":"1 vector.","element":"span"}],[{"style":{"width":"67%"},"width":1272,"height":208,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-4.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-89","text":"A.1","element":"a"},{"text":". (i). Set ","element":"span"},{"style":{"height":13.1},"width":175.84,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-5.png","element":"img","alt":" B1x = x1","inline":true},{"text":", and let ","element":"span"},{"style":{"height":13.82},"width":38.96,"height":34.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-6.png","element":"img","alt":" a′j","inline":true,"padRight":true},{"text":"be the 1 ","element":"span"},{"style":{"height":10.8},"width":75.84,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-7.png","element":"img","alt":" × K","inline":true,"padRight":true},{"text":"row vector of ","element":"span"},{"style":{"height":13.9},"width":50.56,"height":34.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-8.png","element":"img","alt":" A1","inline":true}],[{"style":{"width":"76%"},"width":1431,"height":228,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-9.png","element":"img"}],[{"text":"where we use H¨older’s inequality for the first inequality, and the relation between ","element":"span"},{"style":{"height":14},"width":91.52,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-10.png","element":"img","alt":" l1, l∞","inline":true,"padRight":true},{"text":"norms for the second inequality, and to get the ","element":"span"},{"href":"#id-90","text":"last inequalit","element":"a"},{"text":"y ","element":"span"},{"href":"#id-90","text":"we r","element":"a"},{"text":"epeat the first two inequalities.","element":"span"}],[{"style":{"width":"96%"},"width":1805,"height":140,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-11.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":16},"width":75.44,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-12.png","element":"img","alt":" ∥.∥l1","inline":true,"padRight":true},{"text":"is the maximum absolute column sum norm of ","element":"span"},{"style":{"height":15.44},"width":104.8,"height":38.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-13.png","element":"img","alt":" B1A′1","inline":true,"padRight":true},{"text":"matrix (i.e. ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-14.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"induced matrix norm). Let ","element":"span"},{"style":{"height":13.1},"width":30.64,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-15.png","element":"img","alt":"bl","inline":true},{"text":"’ be 1 ","element":"span"},{"style":{"height":10.8},"width":75.84,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-16.png","element":"img","alt":" × K","inline":true,"padRight":true},{"text":"row vector of ","element":"span"},{"style":{"height":13.1},"width":52.48,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-17.png","element":"img","alt":" B1","inline":true},{"text":", and ","element":"span"},{"style":{"height":11.9},"width":38.44,"height":29.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-18.png","element":"img","alt":" aj","inline":true,"padRight":true},{"text":"is the ","element":"span"},{"text":"j","element":"span"},{"text":"th column of ","element":"span"},{"style":{"height":15.63},"width":50.56,"height":39.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-19.png","element":"img","alt":" A′1","inline":true,"padRight":true},{"text":"matrix.","element":"span"}],[{"style":{"width":"79%"},"width":1486,"height":256,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/40-20.png","element":"img"}],[{"text":"where we use H¨older’s inequality for the first inequality, and ","element":"span"},{"style":{"height":14},"width":91.52,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-0.png","element":"img","alt":" l1, l∞","inline":true,"padRight":true},{"text":"norm relation for the other inequalities.","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":189,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-1.png","element":"img"}],[{"text":"Next we provide a lemma that is directly from Lemma A.2 of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":".","element":"span"}],[{"text":"Lemma A.2. ","element":"span"},{"href":"#id-17","text":"(Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011","element":"a"},{"text":")). Suppose that two random variables ","element":"span"},{"style":{"height":14},"width":113.92,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-2.png","element":"img","alt":" Z1, Z2","inline":true,"padRight":true},{"text":"satisfy the following exponential type tail condition. There exist ","element":"span"},{"style":{"height":12.64},"width":157.08,"height":31.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-3.png","element":"img","alt":" rz1, rz2 ∈","inline":true,"padRight":true},{"text":"(0","element":"span"},{"text":", ","element":"span"},{"text":"1) ","element":"span"},{"text":"and ","element":"span"},{"style":{"height":14.64},"width":159.64,"height":36.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-4.png","element":"img","alt":" bz1, bz2 >","inline":true,"padRight":true},{"text":"0 ","element":"span"},{"text":"constant such that for all ","element":"span"},{"text":"s > ","element":"span"},{"text":"0","element":"span"}],[{"style":{"width":"69%"},"width":1308,"height":225,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-5.png","element":"img"}],[{"text":"We provide now the following maximal inequality due to Theorem 1 of ","element":"span"},{"href":"#id-91","text":"Merlevede et al. ","element":"a"},{"href":"#id-91","text":"(2011)","element":"a"},{"text":", and used in the proof of Lemma A.3(i) and proof of Lemma B.1(ii) in ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":". To that effect, we provide a general assumption on data, and then show the theorem and its proof.","element":"span"}],[{"text":"Assumption L1. ","element":"span"},{"text":"(i). ","element":"span"},{"style":{"height":14},"width":121.44,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-6.png","element":"img","alt":" Xt, Y t","inline":true,"padRight":true},{"text":"are vectors of dimension ","element":"span"},{"style":{"height":13.1},"width":38.64,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-7.png","element":"img","alt":" dx","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":15.5},"width":36.64,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-8.png","element":"img","alt":" dy","inline":true},{"text":", respectively, for ","element":"span"},{"text":"t ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":119.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-9.png","element":"img","alt":", · · · , n","inline":true},{"text":". They are both stationary and ergodic. Also ","element":"span"},{"style":{"height":16},"width":163.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-10.png","element":"img","alt":" {Xt, Y t}","inline":true,"padRight":true},{"text":"are strong mixing with strong mixing coefficients are satisfying","element":"span"}],[{"style":{"width":"18%"},"width":349,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-11.png","element":"img"}],[{"text":"with ","element":"span"},{"text":"t","element":"span"},{"text":", a positive integer, and ","element":"span"},{"style":{"height":13.5},"width":99.16,"height":33.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-12.png","element":"img","alt":" rxy >","inline":true,"padRight":true},{"text":"0 ","element":"span"},{"text":"a positive constant. (ii). We also let ","element":"span"},{"style":{"height":14},"width":121.92,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-13.png","element":"img","alt":" Xt, Y t","inline":true,"padRight":true},{"text":"satisfy the exponential tail condition for ","element":"span"},{"style":{"height":13.6},"width":32.32,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-14.png","element":"img","alt":" j1","inline":true,"padRight":true},{"text":"= 1","element":"span"},{"style":{"height":14},"width":185.92,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-15.png","element":"img","alt":", · · · , dx, j2","inline":true,"padRight":true},{"text":"= 1","element":"span"},{"style":{"height":15.5},"width":131.68,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-16.png","element":"img","alt":", · · · , dy","inline":true}],[{"style":{"width":"96%"},"width":1808,"height":986,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-17.png","element":"img"}],[{"text":"Proof of Theorem ","element":"span"},{"href":"#id-43","text":"A.1","element":"a"},{"text":". This is a simple application of Lemma A.2 above with Assumption L1 for Theorem 1 of ","element":"span"},{"href":"#id-91","text":"Merlevede et al. ","element":"a"},{"href":"#id-91","text":"(2011)","element":"a"},{"text":", and Bonferroni union bound.","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/41-18.png","element":"img"}],[{"text":"Part 2","element":"span"}],[{"text":"We start with an important maximal inequality applied to factor models in nodewise regression setting. Some of the results are already in Lemma A.3, Lemma B.1 of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":". We show them so that readers can see all results without referral to other literature. We also provide two new results Lemma ","element":"span"},{"href":"#id-64","text":"A.3(","element":"a"},{"text":"ii), (v) due to nodewise regression interaction with factor models.","element":"span"}],[{"text":"Lemma A.3. ","element":"span"},{"text":"Under Assumptions ","element":"span"},{"href":"#id-38","text":"1-","element":"a"},{"href":"#id-39","text":"3, ","element":"a"},{"text":"for ","element":"span"},{"style":{"height":13.11},"width":205.72,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-0.png","element":"img","alt":" C > Cm >","inline":true,"padRight":true},{"text":"0","element":"span"},{"text":", with ","element":"span"},{"text":"m ","element":"span"},{"text":"= 1","element":"span"},{"text":", ","element":"span"},{"text":"2","element":"span"},{"text":", ","element":"span"},{"text":"3","element":"span"},{"text":", ","element":"span"},{"text":"4","element":"span"},{"text":", ","element":"span"},{"text":"5 ","element":"span"},{"text":"with ","element":"span"},{"style":{"height":13.11},"width":56.33,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-1.png","element":"img","alt":" Cm","inline":true,"padRight":true},{"text":"that is used in Theorem ","element":"span"},{"href":"#id-43","text":"A.1. ","element":"a"},{"text":"(i).","element":"span"}],[{"style":{"width":"64%"},"width":1211,"height":165,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-2.png","element":"img"}],[{"text":"(ii). Denote ","element":"span"},{"style":{"height":15.51},"width":74.44,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-3.png","element":"img","alt":" U −j","inline":true,"padRight":true},{"text":"as the ","element":"span"},{"text":"(","element":"span"},{"style":{"height":10},"width":60.76,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-4.png","element":"img","alt":"p −","inline":true,"padRight":true},{"text":"1) ","element":"span"},{"style":{"height":8},"width":64.32,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-5.png","element":"img","alt":" × n","inline":true,"padRight":true},{"text":"matrix in ","element":"span"},{"href":"#id-35","text":"(4)","element":"a"},{"text":", and let the ","element":"span"},{"text":"l ","element":"span"},{"text":"th row and ","element":"span"},{"text":"t ","element":"span"},{"text":"th column element ","element":"span"},{"style":{"height":15.51},"width":106.56,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-6.png","element":"img","alt":" U−j,l,t","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":13.44},"width":38.44,"height":33.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-7.png","element":"img","alt":" ηj","inline":true,"padRight":true},{"text":"as ","element":"span"},{"style":{"height":8},"width":63.64,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-8.png","element":"img","alt":" n ×","inline":true,"padRight":true},{"text":"1 ","element":"span"},{"text":"vector, and the ","element":"span"},{"text":"t ","element":"span"},{"text":"th element as ","element":"span"},{"style":{"height":11.51},"width":54.24,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-9.png","element":"img","alt":" ηj,t","inline":true}],[{"style":{"width":"82%"},"width":1545,"height":959,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-10.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-64","text":"A.3","element":"a"},{"text":". (i). This is Lemma ","element":"span"},{"href":"#id-64","text":"A.3(","element":"a"},{"text":"i) of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":".","element":"span"}],[{"text":"(ii). The proof follows from Theorem ","element":"span"},{"href":"#id-43","text":"A.1 ","element":"a"},{"text":"and Assumption ","element":"span"},{"href":"#id-39","text":"3 ","element":"a"},{"text":"provides the tail probability through the same algebra as in p.3346 of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"href":"#id-17","text":".","element":"a"}],[{"style":{"width":"43%"},"width":822,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-11.png","element":"img"}],[{"text":"(v). The proof will involve several steps and this is due to interaction of factor models (","element":"span"},{"style":{"height":15.5},"width":58.56,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-12.png","element":"img","alt":"fk,t","inline":true},{"text":") and nodewise error (","element":"span"},{"style":{"height":11.5},"width":54.24,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-13.png","element":"img","alt":"ηj,t","inline":true},{"text":"). Start with the definition of","element":"span"}],[{"style":{"width":"70%"},"width":1326,"height":204,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/42-14.png","element":"img"}],[{"style":{"width":"5%"},"width":102,"height":44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-0.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":15.5},"width":48.52,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-1.png","element":"img","alt":" Cj","inline":true,"padRight":true},{"text":":=","element":"span"}],[{"style":{"width":"86%"},"width":1624,"height":548,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-2.png","element":"img"}],[{"text":"where we use ","element":"span"},{"href":"#id-92","text":"(A.7) ","element":"a"},{"text":"for the first equality and H¨older’s inequality for the first inequality. Consider","element":"span"}],[{"style":{"width":"64%"},"width":1201,"height":62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-3.png","element":"img"}],[{"text":"where we use ","element":"span"},{"style":{"height":15.5},"width":48.04,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-4.png","element":"img","alt":" Cj","inline":true,"padRight":true},{"text":"definition. Noting that ","element":"span"},{"style":{"height":15.5},"width":147.88,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-5.png","element":"img","alt":" Σn,−j,−j","inline":true,"padRight":true},{"text":"is ","element":"span"},{"style":{"height":14},"width":198.52,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-6.png","element":"img","alt":" p − 1 × p −","inline":true,"padRight":true},{"text":"1 submatrix of ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-7.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"consisting all rows and columns of ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-8.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"except the ","element":"span"},{"text":"j","element":"span"},{"text":"th one. See that","element":"span"}],[{"style":{"width":"54%"},"width":1018,"height":107,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-9.png","element":"img"}],[{"text":"Then,","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":161,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-10.png","element":"img"}],[{"text":"for the second inequality in ","element":"span"},{"href":"#id-93","text":"(A.10)","element":"a"},{"text":", given our Assumption max","element":"span"},{"style":{"height":19.79},"width":390.88,"height":49.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-11.png","element":"img","alt":"1≤j≤p E[u2j,t] ≤ C < ∞","inline":true},{"text":". Hence,","element":"span"}],[{"style":{"width":"71%"},"width":1343,"height":67,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-12.png","element":"img"}],[{"text":"by ","element":"span"},{"href":"#id-93","text":"(A.10)","element":"a"},{"text":". Clearly, by ","element":"span"},{"href":"#id-94","text":"(A.9)","element":"a"},{"text":"-","element":"span"},{"href":"#id-95","text":"(A.11)","element":"a"}],[{"style":{"width":"20%"},"width":386,"height":68,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-13.png","element":"img"}],[{"text":"Then since ","element":"span"},{"style":{"height":15.51},"width":46.12,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-14.png","element":"img","alt":" Ωj","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":26.86},"width":39.36,"height":67.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-15.png","element":"img","alt":"Cjτ 2j","inline":true,"padRight":true},{"text":"and by ","element":"span"},{"href":"#id-96","text":"(A.61)","element":"a"}],[{"style":{"width":"59%"},"width":1122,"height":59,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-16.png","element":"img"}],[{"text":"Next, use Lemma ","element":"span"},{"href":"#id-64","text":"A.3(","element":"a"},{"text":"iii) and ","element":"span"},{"href":"#id-97","text":"(A.12) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-94","text":"(A.8) ","element":"a"},{"text":"to show","element":"span"}],[{"style":{"width":"53%"},"width":999,"height":168,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-17.png","element":"img"}],[{"text":"This also implies that, since ","element":"span"},{"text":"X ","element":"span"},{"text":":= (","element":"span"},{"style":{"height":14.64},"width":187.04,"height":36.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-18.png","element":"img","alt":"f 1, · · · , f n","inline":true},{"text":") : ","element":"span"},{"style":{"height":10.8},"width":109.44,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-19.png","element":"img","alt":" K × n","inline":true,"padRight":true},{"text":"matrix, and ","element":"span"},{"style":{"height":13.44},"width":38.44,"height":33.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-20.png","element":"img","alt":" ηj","inline":true,"padRight":true},{"text":":= (","element":"span"},{"style":{"height":16.7},"width":249.2,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-21.png","element":"img","alt":"ηj,1, · · · , ηj,n)′","inline":true,"padRight":true},{"text":": ","element":"span"},{"style":{"height":8},"width":63.64,"height":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-22.png","element":"img","alt":" n ×","inline":true,"padRight":true},{"text":"1","element":"span"}],[{"style":{"width":"66%"},"width":1254,"height":200,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/43-23.png","element":"img"}],[{"text":"Now we start defining two events, and we condition the next lemma, which is ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-0.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"bound on nodewise regression estimates, on these two events. Then we relax this restriction, and show that an unconditional result for ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-1.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"norm of the nodewise regression estimates after finding that these two events converge in","element":"span"}],[{"text":"probability to one. Define","element":"span"}],[{"style":{"width":"67%"},"width":1270,"height":90,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-2.png","element":"img"}],[{"text":"and define the population adaptive restricted eigenvalue condition, as in ","element":"span"},{"href":"#id-8","text":"Caner and Kock ","element":"a"},{"href":"#id-8","text":"(2018)","element":"a"},{"text":", for ","element":"span"},{"text":"j ","element":"span"},{"text":"=","element":"span"}],[{"text":"1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-3.png","element":"img","alt":", · · · , p","inline":true},{"text":", and let ","element":"span"},{"style":{"height":17.04},"width":54.24,"height":42.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-4.png","element":"img","alt":" δSj","inline":true,"padRight":true},{"text":"represent the vector with ","element":"span"},{"style":{"height":15.5},"width":37.48,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-5.png","element":"img","alt":" Sj","inline":true,"padRight":true},{"text":"indices in ","element":"span"},{"style":{"height":16.3},"width":35.56,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-6.png","element":"img","alt":" δj","inline":true},{"text":", and all the other elements than ","element":"span"},{"style":{"height":15.5},"width":37.48,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-7.png","element":"img","alt":" Sj","inline":true,"padRight":true},{"text":"indices in ","element":"span"},{"style":{"height":16.3},"width":35.56,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-8.png","element":"img","alt":"δj","inline":true,"padRight":true},{"text":"set to zero","element":"span"}],[{"style":{"width":"83%"},"width":1572,"height":164,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-9.png","element":"img"}],[{"text":"and the empirical version of the adaptive restricted eigenvalue condition is as follows, with ","element":"span"},{"style":{"height":15.5},"width":274.08,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-10.png","element":"img","alt":"�U −j : p − 1 × n","inline":true,"padRight":true},{"text":"matrix","element":"span"}],[{"style":{"width":"86%"},"width":1614,"height":130,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-11.png","element":"img"}],[{"text":"and the event is for each ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-12.png","element":"img","alt":", · · · , p","inline":true}],[{"style":{"width":"25%"},"width":486,"height":53,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-13.png","element":"img"}],[{"text":"We have the following ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-14.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"bound result.","element":"span"}],[{"style":{"width":"99%"},"width":1867,"height":499,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-15.png","element":"img"}],[{"text":"Use ","element":"span"},{"href":"#id-75","text":"(10) ","element":"a"},{"text":"to have ","element":"span"},{"style":{"height":17.95},"width":208.36,"height":44.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-16.png","element":"img","alt":" �uj − �U′−j�γj","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":18.24},"width":329.8,"height":45.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-17.png","element":"img","alt":" ηxj − �U′−j(�γj − γj","inline":true},{"text":") and this last equation can be substituted into first left ","element":"span"},{"text":"side term and first right side term in ","element":"span"},{"href":"#id-98","text":"(A.17) ","element":"a"},{"text":"to have","element":"span"}],[{"style":{"width":"78%"},"width":1478,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-18.png","element":"img"}],[{"text":"Simplify the first term on the left and the first term on the right side of ","element":"span"},{"href":"#id-99","text":"(A.18)","element":"a"},{"text":",","element":"span"}],[{"style":{"width":"82%"},"width":1545,"height":146,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-19.png","element":"img"}],[{"text":"Since we use ","element":"span"},{"style":{"height":13.9},"width":47.68,"height":34.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-20.png","element":"img","alt":" A1","inline":true,"padRight":true},{"text":"and then H¨older’s inequality","element":"span"}],[{"style":{"width":"79%"},"width":1481,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/44-21.png","element":"img"}],[{"text":"Use ","element":"span"},{"style":{"height":21.09},"width":452.8,"height":52.72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-0.png","element":"img","alt":" ∥�γj∥1 = ∥�γSj∥1 + ∥�γScj ∥1","inline":true},{"text":", on the second term on the left side of ","element":"span"},{"href":"#id-100","text":"(A.20) ","element":"a"},{"text":"(","element":"span"},{"style":{"height":15.5},"width":37.48,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-1.png","element":"img","alt":"Sj","inline":true,"padRight":true},{"text":"represents the indices of ","element":"span"},{"text":"nonzero cells in row ","element":"span"},{"text":"j ","element":"span"},{"text":"of the precision matrix, and ","element":"span"},{"style":{"height":17.23},"width":40.88,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-2.png","element":"img","alt":" Scj","inline":true,"padRight":true},{"text":"represents the indices of zero cells in row ","element":"span"},{"text":"j ","element":"span"},{"text":"of the ","element":"span"},{"text":"precision matrix).","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":628,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-3.png","element":"img"}],[{"text":"Use the norm inequality ","element":"span"},{"style":{"height":19.17},"width":582.87,"height":47.92,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-4.png","element":"img","alt":" ∥�γSj − γSj∥1 ≤ √sj∥�γSj − γSj∥2","inline":true}],[{"style":{"width":"75%"},"width":1416,"height":107,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-5.png","element":"img"}],[{"text":"Now ignoring the first term above and dividing the rest by ","element":"span"},{"style":{"height":13.11},"width":92.44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-6.png","element":"img","alt":" λn >","inline":true,"padRight":true},{"text":"0, provides the restricted set condition","element":"span"}],[{"text":"(cone condition) in adaptive restricted eigenvalue condition","element":"span"}],[{"style":{"width":"63%"},"width":1181,"height":55,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-7.png","element":"img"}],[{"text":"Set ","element":"span"},{"style":{"height":18.03},"width":216.04,"height":45.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-8.png","element":"img","alt":" δj = �γj −γj","inline":true,"padRight":true},{"text":"in the empirical adaptive restricted set condition in ","element":"span"},{"href":"#id-101","text":"(A.16)","element":"a"},{"text":", then use the empirical adaptive restricted eigenvalue condition in ","element":"span"},{"href":"#id-102","text":"(A.24)","element":"a"}],[{"style":{"width":"56%"},"width":1054,"height":137,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-9.png","element":"img"}],[{"text":"Then use 3","element":"span"},{"style":{"height":17.36},"width":150.08,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-10.png","element":"img","alt":"ab ≤ a2/","inline":true},{"text":"2 + 9","element":"span"},{"style":{"height":17.36},"width":54.56,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-11.png","element":"img","alt":"b2/","inline":true},{"text":"2 with ","element":"span"},{"style":{"height":27.82},"width":166.56,"height":69.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-12.png","element":"img","alt":" b =λn√sj�φ(sj)","inline":true,"padRight":true},{"text":", ","element":"span"},{"style":{"height":18.43},"width":377.6,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-13.png","element":"img","alt":" a = ∥ �U′−j(�γj − γj)∥n","inline":true},{"text":".","element":"span"}],[{"style":{"width":"59%"},"width":1107,"height":138,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-14.png","element":"img"}],[{"text":"Use ","element":"span"},{"style":{"height":16.31},"width":60.52,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-15.png","element":"img","alt":" A2j","inline":true,"padRight":true},{"text":"in the first term on the right side and simplify","element":"span"}],[{"style":{"width":"40%"},"width":756,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-16.png","element":"img"}],[{"text":"This implies","element":"span"}],[{"style":{"width":"62%"},"width":1178,"height":93,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-17.png","element":"img"}],[{"text":"Now to get ","element":"span"},{"style":{"height":13.11},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-18.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"bound, ignore the first term in ","element":"span"},{"href":"#id-102","text":"(A.24) ","element":"a"},{"text":"and add both sides ","element":"span"},{"style":{"height":18.98},"width":275.2,"height":47.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-19.png","element":"img","alt":" λn∥�γSj − γSj∥1","inline":true}],[{"style":{"width":"94%"},"width":1774,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/45-20.png","element":"img"}],[{"text":"Use the norm inequality ","element":"span"},{"style":{"height":18.97},"width":582.88,"height":47.44,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-0.png","element":"img","alt":" ∥�γSj − γSj∥1 ≤ √sj∥�γSj − γSj∥2","inline":true,"padRight":true},{"text":"for the first term on the right side of ","element":"span"},{"href":"#id-103","text":"(A.27)","element":"a"}],[{"style":{"width":"67%"},"width":1265,"height":198,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-1.png","element":"img"}],[{"text":"and can use the empirical adaptive restricted eigenvalue condition in ","element":"span"},{"href":"#id-101","text":"(A.16)","element":"a"}],[{"style":{"width":"34%"},"width":652,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-2.png","element":"img"}],[{"text":"Next, use ","element":"span"},{"href":"#id-104","text":"(A.26) ","element":"a"},{"text":"and ","element":"span"},{"style":{"height":16.3},"width":60.52,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-3.png","element":"img","alt":" A2j","inline":true,"padRight":true},{"text":"to have","element":"span"}],[{"style":{"width":"29%"},"width":544,"height":95,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-4.png","element":"img"}],[{"text":"Last inequality above is true by noticing ","element":"span"},{"style":{"height":15.11},"width":110.24,"height":37.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-5.png","element":"img","alt":" sj ≤ ¯s","inline":true,"padRight":true},{"text":"by ¯","element":"span"},{"text":"s ","element":"span"},{"text":"definition, and then by definition of population adaptive restricted eigenvalue condition ","element":"span"},{"style":{"height":18.06},"width":239.84,"height":45.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-6.png","element":"img","alt":" φ2(sj) ≥ φ2(¯s","inline":true},{"text":").","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-7.png","element":"img"}],[{"text":"Now we evaluate two events, in the next two lemmata.","element":"span"}],[{"style":{"width":"65%"},"width":1233,"height":402,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-8.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-105","text":"A.5","element":"a"},{"text":". Start with ","element":"span"},{"style":{"height":13.9},"width":47.68,"height":34.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-9.png","element":"img","alt":" A1","inline":true,"padRight":true},{"text":"definition in ","element":"span"},{"href":"#id-106","text":"(A.14)","element":"a"},{"text":". Use ","element":"span"},{"href":"#id-37","text":"(9)","element":"a"},{"text":"-","element":"span"},{"href":"#id-75","text":"(11) ","element":"a"},{"text":"and ","element":"span"},{"style":{"height":13.1},"width":76.92,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-10.png","element":"img","alt":" M X","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":17.36},"width":352.44,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-11.png","element":"img","alt":" In − X′(XX′)−1X","inline":true,"padRight":true},{"text":"is idempotent such that","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":377,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-12.png","element":"img"}],[{"text":"Note that ","element":"span"},{"text":"U ","element":"span"},{"text":"is a ","element":"span"},{"style":{"height":11.2},"width":93.6,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-13.png","element":"img","alt":" p × n","inline":true,"padRight":true},{"text":"matrix and ","element":"span"},{"style":{"height":15.5},"width":74.44,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-14.png","element":"img","alt":" U −j","inline":true,"padRight":true},{"text":"is the ","element":"span"},{"style":{"height":14},"width":162.72,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-15.png","element":"img","alt":" p − 1 × n","inline":true,"padRight":true},{"text":"submatrix, which is ","element":"span"},{"text":"U ","element":"span"},{"text":"without the ","element":"span"},{"text":"j","element":"span"},{"text":"th row. As a","element":"span"}],[{"text":"consequence,","element":"span"}],[{"style":{"width":"64%"},"width":1213,"height":116,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-16.png","element":"img"}],[{"text":"with probability at least 1 ","element":"span"},{"style":{"height":17.36},"width":163.36,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-17.png","element":"img","alt":" − O(1/p2","inline":true},{"text":") by Lemma ","element":"span"},{"href":"#id-64","text":"A.3(","element":"a"},{"text":"ii). Next, for the second right side term in ","element":"span"},{"href":"#id-107","text":"(A.29) ","element":"a"},{"text":"we","element":"span"}],[{"text":"have that","element":"span"}],[{"style":{"width":"97%"},"width":1816,"height":191,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/46-18.png","element":"img"}],[{"text":"by Lemma ","element":"span"},{"href":"#id-89","text":"A.1(","element":"a"},{"text":"i). We evaluate each term in ","element":"span"},{"href":"#id-108","text":"(A.31)","element":"a"},{"text":". Note that ","element":"span"},{"text":"X ","element":"span"},{"text":"= (","element":"span"},{"style":{"height":14.64},"width":187.04,"height":36.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-0.png","element":"img","alt":"f 1, · · · , f n","inline":true},{"text":") : ","element":"span"},{"style":{"height":14},"width":296.16,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-1.png","element":"img","alt":" K × n, U : p × n","inline":true}],[{"style":{"width":"60%"},"width":1132,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-2.png","element":"img"}],[{"text":"Then, by Lemma ","element":"span"},{"href":"#id-64","text":"A.3(","element":"a"},{"text":"iii),","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":359,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-3.png","element":"img"}],[{"text":"with probability at least 1 ","element":"span"},{"style":{"height":17.36},"width":166.72,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-4.png","element":"img","alt":" − O(1/n2","inline":true},{"text":"). Next, since ","element":"span"},{"style":{"height":14.24},"width":139,"height":35.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-5.png","element":"img","alt":" ηj : n ×","inline":true,"padRight":true},{"text":"1, and ","element":"span"},{"style":{"height":13.44},"width":60,"height":33.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-6.png","element":"img","alt":" ηj,t","inline":true,"padRight":true},{"text":"is the ","element":"span"},{"text":"t","element":"span"},{"text":"th element","element":"span"}],[{"style":{"width":"71%"},"width":1333,"height":146,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-7.png","element":"img"}],[{"text":"Then, by Lemma ","element":"span"},{"href":"#id-64","text":"A.3(","element":"a"},{"text":"v),","element":"span"}],[{"style":{"width":"78%"},"width":1467,"height":168,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-8.png","element":"img"}],[{"text":"Combine ","element":"span"},{"href":"#id-109","text":"(A.30)","element":"a"},{"text":"-","element":"span"},{"href":"#id-110","text":"(A.35) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-107","text":"(A.29) ","element":"a"},{"text":"in order to form","element":"span"}],[{"style":{"width":"72%"},"width":1362,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-9.png","element":"img"}],[{"text":"with probability at least 1 ","element":"span"},{"style":{"height":17.36},"width":355.84,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-10.png","element":"img","alt":" − O(1/p2) − O(1/n2","inline":true},{"text":"). Now use ","element":"span"},{"href":"#id-106","text":"(A.14) ","element":"a"},{"text":"to get ","element":"span"},{"style":{"height":13.1},"width":43.04,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-11.png","element":"img","alt":" λn","inline":true},{"text":".","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":200,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-12.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-111","text":"A.6","element":"a"},{"text":". For each ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-13.png","element":"img","alt":", · · · , p","inline":true},{"text":", add and subtract ","element":"span"},{"style":{"height":18.43},"width":305.8,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-14.png","element":"img","alt":" δ′j(U −jU ′−j/n)δj","inline":true}],[{"style":{"width":"99%"},"width":1867,"height":686,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/47-15.png","element":"img"}],[{"text":"Note that second right side term with absolute value in ","element":"span"},{"href":"#id-112","text":"(A.37) ","element":"a"},{"text":"can be bounded by using H¨older’s inequality","element":"span"}],[{"style":{"width":"81%"},"width":1531,"height":177,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-0.png","element":"img"}],[{"text":"By the same analysis applied to the first right side term with absolute value in ","element":"span"},{"href":"#id-112","text":"(A.37) ","element":"a"},{"text":"and simplifying","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":874,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-1.png","element":"img"}],[{"text":"with probability at least 1 ","element":"span"},{"style":{"height":17.36},"width":359.2,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-2.png","element":"img","alt":" − O(1/p2) − O(1/n2","inline":true},{"text":"). Next, in ","element":"span"},{"href":"#id-113","text":"(A.38) ","element":"a"},{"text":"see that ","element":"span"},{"style":{"height":15.5},"width":147.88,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-3.png","element":"img","alt":" Σn,−j,−j","inline":true,"padRight":true},{"text":"is a submatrix of ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-4.png","element":"img","alt":" Σn","inline":true},{"text":", and ","element":"span"},{"style":{"height":15.51},"width":74.44,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-5.png","element":"img","alt":" U −j","inline":true,"padRight":true},{"text":"is a submatrix of ","element":"span"},{"text":"U ","element":"span"},{"text":"as described above and","element":"span"}],[{"style":{"width":"76%"},"width":1441,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-6.png","element":"img"}],[{"text":"with probability at least 1 ","element":"span"},{"style":{"height":17.36},"width":163.84,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-7.png","element":"img","alt":" − O(1/p2","inline":true},{"text":") by Lemma ","element":"span"},{"href":"#id-64","text":"A.3(","element":"a"},{"text":"i). We need to provide some simplification for ","element":"span"},{"style":{"height":18.06},"width":95.2,"height":45.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-8.png","element":"img","alt":" ∥δj∥21","inline":true,"padRight":true},{"text":"term in ","element":"span"},{"href":"#id-113","text":"(A.38)","element":"a"},{"text":". Next, since the cone condition in adaptive restricted eigenvalue condition is satisfied in","element":"span"}],[{"style":{"width":"60%"},"width":1124,"height":109,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-9.png","element":"img"}],[{"text":"Then, add ","element":"span"},{"style":{"height":17.44},"width":114.88,"height":43.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-10.png","element":"img","alt":" ∥δSj∥1","inline":true,"padRight":true},{"text":"to the left side and right side and use the norm inequality that puts an upper bound on the ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-11.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"norm in terms of the ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-12.png","element":"img","alt":" l2","inline":true,"padRight":true},{"text":"norm. Hence,","element":"span"}],[{"style":{"width":"48%"},"width":906,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-13.png","element":"img"}],[{"text":"So, we have that","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":465,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/48-14.png","element":"img"}],[{"text":"Next, using the empirical and population adaptive restricted eigenvalue definitions and minimizing over ","element":"span"},{"style":{"height":16.3},"width":35.56,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-0.png","element":"img","alt":" δj","inline":true,"padRight":true},{"text":"we have that","element":"span"}],[{"style":{"width":"77%"},"width":1446,"height":284,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-1.png","element":"img"}],[{"text":"Note that, if we have with probability approaching one (wpa1 from now on)","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":269,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-2.png","element":"img"}],[{"text":"Thus, we need to show that following probability goes to zero","element":"span"}],[{"style":{"width":"98%"},"width":1838,"height":271,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-3.png","element":"img"}],[{"text":"Set ","element":"span"},{"style":{"height":4.8},"width":36.32,"height":12,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-4.png","element":"img","alt":" ǫn","inline":true,"padRight":true},{"text":":= 16","element":"span"},{"style":{"height":28.8},"width":113.44,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-5.png","element":"img","alt":"sj�K2","inline":true,"padRight":true},{"text":"ln(","element":"span"},{"style":{"height":19.2},"width":168.16,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-6.png","element":"img","alt":"p)/n +�","inline":true},{"text":"ln(","element":"span"},{"style":{"height":28.8},"width":98.68,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-7.png","element":"img","alt":"p)/n�","inline":true},{"text":". Clearly, by ","element":"span"},{"href":"#id-51","text":"(A.40) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-52","text":"(A.41) ","element":"a"},{"text":"we have that","element":"span"}],[{"style":{"width":"86%"},"width":1628,"height":296,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-8.png","element":"img"}],[{"text":"Since ","element":"span"},{"style":{"height":10.7},"width":88.96,"height":26.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-9.png","element":"img","alt":" ǫn →","inline":true,"padRight":true},{"text":"0 by Assumption ","element":"span"},{"href":"#id-42","text":"5, ","element":"a"},{"text":"by ","element":"span"},{"href":"#id-114","text":"(A.46)","element":"a"},{"href":"#id-115","text":"(A.47) ","element":"a"},{"style":{"height":18.06},"width":422.56,"height":45.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-10.png","element":"img","alt":" P(�φ2(sj) < φ2(sj)/2) →","inline":true,"padRight":true},{"text":"0.","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-11.png","element":"img"}],[{"text":"One crucial point is that we need to get a low bound for ","element":"span"},{"style":{"height":19.29},"width":144.04,"height":48.24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-12.png","element":"img","alt":" ∩pj=1A2j","inline":true},{"text":". In that respect, from ","element":"span"},{"href":"#id-116","text":"(A.45) ","element":"a"},{"style":{"height":29.2},"width":35,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-13.png","element":"img","alt":"","inline":true}],[{"style":{"width":"75%"},"width":1417,"height":205,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-14.png","element":"img"}],[{"text":"Clearly by the definitions of ","element":"span"},{"style":{"height":13.1},"width":53.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-15.png","element":"img","alt":" Σn","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":15.5},"width":147.88,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-16.png","element":"img","alt":" Σn,−j,−j","inline":true,"padRight":true},{"text":"and population adaptive restricted eigenvalue condition, we have that","element":"span"}],[{"style":{"width":"81%"},"width":1529,"height":414,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/49-17.png","element":"img"}],[{"style":{"width":"89%"},"width":1672,"height":438,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-0.png","element":"img"}],[{"text":"Next, by ","element":"span"},{"href":"#id-117","text":"(A.33) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-52","text":"(A.41)","element":"a"},{"text":", via Lemma ","element":"span"},{"href":"#id-64","text":"A.3(","element":"a"},{"text":"iii), we have that","element":"span"}],[{"style":{"width":"95%"},"width":1797,"height":181,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-1.png","element":"img"}],[{"text":"We provide the main consistency result for residual based nodewise regression result.","element":"span"}],[{"id":"id-118","style":{"width":"67%"},"width":1258,"height":148,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-2.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-118","text":"A.7","element":"a"},{"text":". Use Lemmata ","element":"span"},{"href":"#id-105","text":"A.5-","element":"a"},{"href":"#id-111","text":"A.6 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-119","text":"(A.49) ","element":"a"},{"text":"to have","element":"span"}],[{"style":{"width":"43%"},"width":823,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-3.png","element":"img"}],[{"text":"Then, combine above with Lemma ","element":"span"},{"href":"#id-120","text":"A.4 ","element":"a"},{"text":"to have the desired result via Assumption ","element":"span"},{"href":"#id-42","text":"5 ","element":"a"},{"text":"and Lemma ","element":"span"},{"href":"#id-105","text":"A.5 ","element":"a"},{"text":"to have ","element":"span"},{"style":{"height":13.1},"width":135.64,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-4.png","element":"img","alt":"λn¯s = o","inline":true},{"text":"(1).","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-5.png","element":"img"}],[{"text":"Next, we provide proof of consistency for the estimates of the reciprocal of the main diagonal elements of the precision matrix.","element":"span"}],[{"id":"id-130","style":{"width":"79%"},"width":1497,"height":357,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-6.png","element":"img"}],[{"text":"and ","element":"span"},{"style":{"height":19.98},"width":37.6,"height":49.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-7.png","element":"img","alt":" τ 2j","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":28.8},"width":127,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-8.png","element":"img","alt":" E�η2j,t�","inline":true},{"text":", with ","element":"span"},{"style":{"height":11.5},"width":54.24,"height":28.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-9.png","element":"img","alt":" ηj,t","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":13.63},"width":234.76,"height":34.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-10.png","element":"img","alt":" uj,t − u′−j,tγj","inline":true},{"text":", and ","element":"span"},{"style":{"height":13.44},"width":38.44,"height":33.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-11.png","element":"img","alt":" ηj","inline":true,"padRight":true},{"text":":= (","element":"span"},{"style":{"height":16.7},"width":343,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-12.png","element":"img","alt":"ηj,1, · · · , ηj,n)′ : n ×","inline":true,"padRight":true},{"text":"1 vector ","element":"span"},{"style":{"height":13.44},"width":56.68,"height":33.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-13.png","element":"img","alt":" ηxj","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":17.04},"width":119.08,"height":42.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-14.png","element":"img","alt":" M Xηj","inline":true},{"text":". Using ","element":"span"},{"href":"#id-75","text":"(10) ","element":"a"},{"text":"for ","element":"span"},{"style":{"height":11.9},"width":40.36,"height":29.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-15.png","element":"img","alt":" �uj","inline":true,"padRight":true},{"text":"in ","element":"span"},{"style":{"height":19.98},"width":37.6,"height":49.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-16.png","element":"img","alt":" �τ 2j","inline":true,"padRight":true},{"text":"definition we have","element":"span"}],[{"style":{"width":"45%"},"width":849,"height":300,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/50-17.png","element":"img"}],[{"text":"By the triangle inequality we get","element":"span"}],[{"style":{"width":"83%"},"width":1560,"height":329,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-0.png","element":"img"}],[{"text":"Consider each term in ","element":"span"},{"href":"#id-121","text":"(A.50) ","element":"a"},{"text":"carefully. Start with definition; ","element":"span"},{"style":{"height":13.11},"width":76.92,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-1.png","element":"img","alt":" M X","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":17.36},"width":355.32,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-2.png","element":"img","alt":" In − X′(XX′)−1X","inline":true},{"text":", and ","element":"span"},{"style":{"height":13.11},"width":76.92,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-3.png","element":"img","alt":" M X","inline":true,"padRight":true},{"text":"being idempotent.","element":"span"}],[{"style":{"width":"77%"},"width":1443,"height":282,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-4.png","element":"img"}],[{"text":"First, exactly as in Lemma ","element":"span"},{"href":"#id-64","text":"A.3(","element":"a"},{"text":"i) with Assumption ","element":"span"},{"href":"#id-40","text":"2(","element":"a"},{"text":"ii)(iv), 3","element":"span"},{"style":{"height":18.74},"width":214.36,"height":46.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-5.png","element":"img","alt":"r−12 + r−10 >","inline":true,"padRight":true},{"text":"1 we have by Theorem ","element":"span"},{"href":"#id-43","text":"A.1 ","element":"a"},{"text":"that","element":"span"}],[{"style":{"width":"66%"},"width":1254,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-6.png","element":"img"}],[{"text":"Then note that ","element":"span"},{"style":{"height":17.23},"width":192.76,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-7.png","element":"img","alt":" Xηj : K ×","inline":true,"padRight":true},{"text":"1 vector, and ","element":"span"},{"style":{"height":10.8},"width":95.6,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-8.png","element":"img","alt":" XX′","inline":true,"padRight":true},{"text":": ","element":"span"},{"style":{"height":10.8},"width":121.44,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-9.png","element":"img","alt":" K × K","inline":true,"padRight":true},{"text":"matrix. Therefore,","element":"span"}],[{"style":{"width":"94%"},"width":1769,"height":638,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-10.png","element":"img"}],[{"text":"where we use H¨older’s inequality for the first inequality, and ","element":"span"},{"href":"#id-122","text":"(A.1) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-122","text":"(A.2) ","element":"a"},{"text":"for the second inequality, and the norm inequality between ","element":"span"},{"style":{"height":13.11},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-11.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":13.11},"width":44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-12.png","element":"img","alt":" l∞","inline":true,"padRight":true},{"text":"norms for the third inequality (i.e. ","element":"span"},{"style":{"height":16},"width":129.4,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-13.png","element":"img","alt":" ∥x∥1 ≤","inline":true,"padRight":true},{"text":"dim(","element":"span"},{"style":{"height":16},"width":153.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-14.png","element":"img","alt":"x)∥x∥∞,","inline":true,"padRight":true},{"text":"dim(","element":"span"},{"text":"x","element":"span"},{"text":") :","element":"span"}],[{"text":"dimension of the vector x). Next by ","element":"span"},{"href":"#id-117","text":"(A.33)","element":"a"},{"text":", ","element":"span"},{"href":"#id-123","text":"(A.34)","element":"a"},{"text":", and ","element":"span"},{"href":"#id-110","text":"(A.35)","element":"a"},{"text":", we have by ","element":"span"},{"href":"#id-124","text":"(A.53) ","element":"a"},{"text":"that","element":"span"}],[{"style":{"width":"73%"},"width":1377,"height":146,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-15.png","element":"img"}],[{"text":"Combine ","element":"span"},{"href":"#id-125","text":"(A.52) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-124","text":"(A.54) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-126","text":"(A.51) ","element":"a"},{"text":"to have the first term on the right side of ","element":"span"},{"href":"#id-121","text":"(A.50) ","element":"a"},{"text":"by Assumption ","element":"span"},{"href":"#id-42","text":"5 ","element":"a"},{"text":"to","element":"span"}],[{"text":"get the last equality in ","element":"span"},{"href":"#id-124","text":"(A.55)","element":"a"}],[{"style":{"width":"76%"},"width":1428,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/51-16.png","element":"img"}],[{"style":{"width":"96%"},"width":1807,"height":158,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-0.png","element":"img"}],[{"text":"In ","element":"span"},{"href":"#id-121","text":"(A.50) ","element":"a"},{"text":"consider the second term on the right side by ","element":"span"},{"href":"#id-127","text":"(A.56)","element":"a"},{"text":", Lemma ","element":"span"},{"href":"#id-118","text":"A.7","element":"a"}],[{"style":{"width":"81%"},"width":1528,"height":235,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-1.png","element":"img"}],[{"text":"Consider the third term on the right side of ","element":"span"},{"href":"#id-121","text":"(A.50)","element":"a"},{"text":", where we use H¨older’s inequality to get","element":"span"}],[{"style":{"width":"78%"},"width":1469,"height":232,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-2.png","element":"img"}],[{"text":"for the rates we use ","element":"span"},{"href":"#id-95","text":"(A.11)","element":"a"},{"text":", ","element":"span"},{"href":"#id-127","text":"(A.56)","element":"a"},{"text":". Last we consider the fourth term on the right side of ","element":"span"},{"href":"#id-121","text":"(A.50)","element":"a"},{"text":". To get a better rate, we start with the Karush-Kuhn-Tucker (KKT) conditions in ","element":"span"},{"href":"#id-75","text":"(12)","element":"a"},{"text":". The following ","element":"span"},{"style":{"height":10},"width":57.4,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-3.png","element":"img","alt":" p −","inline":true,"padRight":true},{"text":"1 equations","element":"span"}],[{"text":"form the KKT","element":"span"}],[{"style":{"width":"35%"},"width":668,"height":89,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-4.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":12.3},"width":39.4,"height":30.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-5.png","element":"img","alt":" �κj","inline":true,"padRight":true},{"text":"is the sub-differrential and explained in more detail in p.160 of Caner and Kock (2018) which replaces the gradient in non-differential penalties. Also for all ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1 ","element":"span"},{"style":{"height":16.7},"width":314.84,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-6.png","element":"img","alt":" · · · , p ∥�κj∥∞ ≤ 1.","inline":true,"padRight":true},{"text":"Use ","element":"span"},{"href":"#id-75","text":"(10) ","element":"a"},{"text":"for ","element":"span"},{"style":{"height":11.9},"width":39.88,"height":29.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-7.png","element":"img","alt":" �uj","inline":true,"padRight":true},{"text":"and rewrite KKT as","element":"span"}],[{"style":{"width":"99%"},"width":1867,"height":538,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-8.png","element":"img"}],[{"text":"Then the fourth term on the right side of ","element":"span"},{"href":"#id-121","text":"(A.50)","element":"a"}],[{"style":{"width":"86%"},"width":1617,"height":235,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-9.png","element":"img"}],[{"text":"where we use H¨older’s inequality, ","element":"span"},{"href":"#id-95","text":"(A.11) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-128","text":"(A.59)","element":"a"},{"text":". Clearly, ","element":"span"},{"href":"#id-128","text":"(A.58) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-129","text":"(A.60) ","element":"a"},{"text":"are the slowest among the four terms on the right side of ","element":"span"},{"href":"#id-121","text":"(A.50)","element":"a"},{"text":", and we use Assumption ","element":"span"},{"href":"#id-42","text":"5 ","element":"a"},{"text":"to get the desired result.","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/52-10.png","element":"img"}],[{"text":"Proof of Theorem ","element":"span"},{"href":"#id-43","text":"1","element":"a"},{"text":". First, we derive some of the key results. By definition of ","element":"span"},{"style":{"height":19.98},"width":37.6,"height":49.96,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-0.png","element":"img","alt":" τ 2j","inline":true,"padRight":true},{"text":", for ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":117.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-1.png","element":"img","alt":", · · · , p","inline":true},{"text":", and ","element":"span"},{"text":"since ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-2.png","element":"img","alt":" Ω","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.55},"width":74.08,"height":46.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-3.png","element":"img","alt":" Σ−1n","inline":true,"padRight":true},{"text":", with Assumption ","element":"span"},{"href":"#id-38","text":"1","element":"a"}],[{"style":{"width":"99%"},"width":1868,"height":301,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-4.png","element":"img"}],[{"text":"is bounded away from zero wpa1 by Lemma ","element":"span"},{"href":"#id-130","text":"A.8. ","element":"a"},{"text":"Then","element":"span"}],[{"style":{"width":"76%"},"width":1428,"height":183,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-5.png","element":"img"}],[{"text":"by Lemma ","element":"span"},{"href":"#id-130","text":"A.8, ","element":"a"},{"href":"#id-96","text":"(A.61)","element":"a"},{"text":", and ","element":"span"},{"href":"#id-131","text":"(A.62)","element":"a"},{"text":". Now we complete the proof by using the formula for ","element":"span"},{"style":{"height":15.5},"width":113.8,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-6.png","element":"img","alt":"�Ωj, Ωj","inline":true},{"text":".","element":"span"}],[{"style":{"width":"56%"},"width":1064,"height":608,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-7.png","element":"img"}],[{"text":"where we use ","element":"span"},{"href":"#id-132","text":"(A.63)","element":"a"},{"text":", Lemma ","element":"span"},{"href":"#id-118","text":"A.7, ","element":"a"},{"href":"#id-95","text":"(A.11) ","element":"a"},{"text":"for the rates, and the last equality is by Assumption ","element":"span"},{"href":"#id-42","text":"5.","element":"a"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-8.png","element":"img"}],[{"text":"Part 3","element":"span"}],[{"text":"After the proof of Theorem ","element":"span"},{"href":"#id-43","text":"1 ","element":"a"},{"text":"we provide lemmata that lead to proof of Theorem ","element":"span"},{"href":"#id-88","text":"2. ","element":"a"},{"text":"We start with a lemma that is related to norm inequalities. First define generic matrices, ","element":"span"},{"style":{"height":14.8},"width":697.6,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-9.png","element":"img","alt":" A1 : p × K, A2 : K × p, D1 : K × K, D2","inline":true,"padRight":true},{"text":": ","element":"span"},{"style":{"height":11.2},"width":90.64,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-10.png","element":"img","alt":"p × p","inline":true},{"text":", also define a row vector ","element":"span"},{"style":{"height":7.2},"width":40.4,"height":18,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-11.png","element":"img","alt":" x′","inline":true,"padRight":true},{"text":": 1 ","element":"span"},{"style":{"height":11.2},"width":61.84,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-12.png","element":"img","alt":" × p","inline":true},{"text":", and also define ","element":"span"},{"style":{"height":11.2},"width":91.12,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-13.png","element":"img","alt":" p × p","inline":true,"padRight":true},{"text":"matrices ","element":"span"},{"style":{"height":14.8},"width":124.96,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-14.png","element":"img","alt":" A3, D3","inline":true},{"text":".","element":"span"}],[{"id":"id-138","style":{"width":"71%"},"width":1331,"height":447,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/53-15.png","element":"img"}],[{"style":{"width":"22%"},"width":419,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-0.png","element":"img"}],[{"text":"(i).","element":"span"}],[{"style":{"width":"61%"},"width":1159,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-1.png","element":"img"}],[{"text":"where we use submultiplicativity of matrix norms for the first inequality, and submultiplicativity of matrix norms and the following for the second inequality,","element":"span"}],[{"style":{"width":"46%"},"width":871,"height":66,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-2.png","element":"img"}],[{"text":"where ","element":"span"},{"style":{"height":18.03},"width":77,"height":45.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-3.png","element":"img","alt":" A′2,k","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":16.3},"width":90.28,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-4.png","element":"img","alt":" A2,kj","inline":true,"padRight":true},{"text":"are the ","element":"span"},{"text":"k","element":"span"},{"text":"th row of ","element":"span"},{"style":{"height":13.9},"width":50.56,"height":34.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-5.png","element":"img","alt":" A2","inline":true},{"text":", and ","element":"span"},{"text":"k, j ","element":"span"},{"text":"element of ","element":"span"},{"style":{"height":13.9},"width":50.56,"height":34.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-6.png","element":"img","alt":" A2","inline":true,"padRight":true},{"text":"respectively. Then, for the last inequality, ","element":"span"},{"text":"we use a matrix norm inequality that provides an upper bound for ","element":"span"},{"style":{"height":13.1},"width":44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-7.png","element":"img","alt":" l∞","inline":true,"padRight":true},{"text":"matrix norm in terms of spectral norm in p.365 of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson ","element":"a"},{"href":"#id-31","text":"(2013)","element":"a"},{"text":".","element":"span"}],[{"text":"(ii).","element":"span"}],[{"style":{"width":"58%"},"width":1101,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-8.png","element":"img"}],[{"text":"where we use section 4.3 of ","element":"span"},{"href":"#id-90","text":"van de Geer ","element":"a"},{"href":"#id-90","text":"(2016) ","element":"a"},{"text":"for the first inequality, and the second inequality can be seen by defining ","element":"span"},{"style":{"height":17.23},"width":76.84,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-9.png","element":"img","alt":" D′2,j","inline":true,"padRight":true},{"text":"as the ","element":"span"},{"text":"j","element":"span"},{"text":"th row of ","element":"span"},{"style":{"height":13.1},"width":54.4,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-10.png","element":"img","alt":" D2","inline":true},{"text":", and ","element":"span"},{"style":{"height":16.3},"width":77,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-11.png","element":"img","alt":" A1,k","inline":true,"padRight":true},{"text":"as the ","element":"span"},{"text":"k","element":"span"},{"text":"th column of ","element":"span"},{"style":{"height":13.9},"width":50.56,"height":34.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-12.png","element":"img","alt":" A1","inline":true,"padRight":true},{"text":"and using H¨older’s inequality","element":"span"}],[{"style":{"width":"66%"},"width":1244,"height":233,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-13.png","element":"img"}],[{"text":"(iii).","element":"span"}],[{"style":{"width":"69%"},"width":1298,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-14.png","element":"img"}],[{"text":"where we use p.345 of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson ","element":"a"},{"href":"#id-31","text":"(2013) ","element":"a"},{"text":"for the first inequality, and ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-15.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"matrix norm submultiplicativity for the second inequality, and the last equality is by seeing that transpose of ","element":"span"},{"style":{"height":13.11},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-16.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"matrix norm is ","element":"span"},{"style":{"height":13.11},"width":44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-17.png","element":"img","alt":" l∞","inline":true,"padRight":true},{"text":"matrix norm.","element":"span"}],[{"text":"(iv).","element":"span"}],[{"style":{"width":"62%"},"width":1170,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-18.png","element":"img"}],[{"text":"where we use p.44 ","element":"span"},{"href":"#id-90","text":"van de Geer ","element":"a"},{"href":"#id-90","text":"(2016) ","element":"a"},{"text":"dual norm inequality for the first inequality, then for the second inequality we use submultiplicativity property of matrix norms,and for the last inequality we use ","element":"span"},{"style":{"height":16},"width":116.72,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-19.png","element":"img","alt":" ∥A1∥l1","inline":true,"padRight":true},{"text":":= max","element":"span"},{"style":{"height":20.07},"width":536,"height":50.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-20.png","element":"img","alt":"1≤k≤K�pj=1 |A1,jk| ≤ p∥A1∥∞","inline":true},{"text":", where ","element":"span"},{"style":{"height":16.3},"width":86.6,"height":40.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-21.png","element":"img","alt":" A1,jk","inline":true,"padRight":true},{"text":"is the ","element":"span"},{"text":"j, k ","element":"span"},{"text":"th cell in ","element":"span"},{"style":{"height":13.9},"width":50.56,"height":34.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-22.png","element":"img","alt":" A1","inline":true},{"text":".","element":"span"}],[{"id":"id-147","style":{"width":"99%"},"width":1869,"height":245,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/54-23.png","element":"img"}],[{"style":{"width":"91%"},"width":1710,"height":1072,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-0.png","element":"img"}],[{"text":"where we use ","element":"span"},{"style":{"height":13.1},"width":44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-1.png","element":"img","alt":" l∞","inline":true,"padRight":true},{"text":"norm definition for the first equality, and for the first inequality we use p.345 of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson ","element":"a"},{"href":"#id-31","text":"(2013)","element":"a"},{"text":", which is ","element":"span"},{"style":{"height":16},"width":356.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-2.png","element":"img","alt":" ∥Ax∥1 ≤ ∥A∥l1∥x∥1","inline":true,"padRight":true},{"text":"for a generic matrix ","element":"span"},{"text":"A","element":"span"},{"text":", and generic vector ","element":"span"},{"text":"x","element":"span"},{"text":", for the third inequality we ","element":"span"},{"href":"#id-31","text":"use th","element":"a"},{"text":"e upper bound of ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-3.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"induced matri","element":"span"},{"href":"#id-17","text":"x norm in ","element":"a"},{"text":"t","element":"span"},{"href":"#id-17","text":"erms ","element":"a"},{"text":"of spectral norm, as in p.365 of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson","element":"a"}],[{"style":{"width":"99%"},"width":1865,"height":182,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-4.png","element":"img"}],[{"text":"by Assumption ","element":"span"},{"href":"#id-48","text":"6 ","element":"a"},{"text":"that ","element":"span"},{"style":{"height":16.71},"width":163.96,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-5.png","element":"img","alt":" |bjk| ≤ C","inline":true,"padRight":true},{"text":"for a positive constant ","element":"span"},{"text":"C ","element":"span"},{"text":"and uniformly over ","element":"span"},{"text":"j ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":14},"width":162.6,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-6.png","element":"img","alt":", · · · , p, k","inline":true,"padRight":true},{"text":"= 1","element":"span"},{"style":{"height":14},"width":131.04,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-7.png","element":"img","alt":", · · · , K","inline":true},{"text":".","element":"span"}],[{"text":"Next, using the results above with Assumption ","element":"span"},{"href":"#id-12","text":"7, ","element":"a"},{"text":"we have","element":"span"}],[{"style":{"width":"15%"},"width":291,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-8.png","element":"img"}],[{"text":"(iii). This is proved in ","element":"span"},{"href":"#id-133","text":"(A.64)","element":"a"},{"text":".","element":"span"}],[{"text":"(iv). The proof of (iv) is the same as in (ii) above except, with ","element":"span"},{"style":{"height":13.1},"width":37.64,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-9.png","element":"img","alt":" bk","inline":true,"padRight":true},{"text":"as the ","element":"span"},{"text":"k","element":"span"},{"text":"th column of matrix ","element":"span"},{"text":"B","element":"span"},{"text":".","element":"span"}],[{"style":{"width":"27%"},"width":512,"height":62,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-10.png","element":"img"}],[{"text":"by Assumption ","element":"span"},{"href":"#id-48","text":"6.","element":"a"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-11.png","element":"img"}],[{"text":"Before the next lemma, we extend two following results which is Lemma B.4 in ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011) ","element":"a"},{"text":"to the case of increasing maximal eigenvalue of errors.","element":"span"}],[{"id":"id-134","text":"Lemma A.11. ","element":"span"},{"text":"Under Assumptions ","element":"span"},{"href":"#id-41","text":"4,","element":"a"},{"href":"#id-48","text":"6, ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i), with ","element":"span"},{"text":"c > ","element":"span"},{"text":"0","element":"span"},{"text":", C > ","element":"span"},{"text":"0","element":"span"},{"text":", and positive finite constants (i). ","element":"span"},{"text":"Eigmin(","element":"span"},{"style":{"height":16},"width":174.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-12.png","element":"img","alt":"B′ΩB) ≥","inline":true,"padRight":true},{"text":"cp ","element":"span"},{"style":{"height":13.1},"width":68.96,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-13.png","element":"img","alt":"Crn","inline":true,"padRight":true},{"text":".","element":"span"}],[{"style":{"width":"65%"},"width":1226,"height":146,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/55-14.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-134","text":"A.11","element":"a"},{"text":". We follow the proof of Lemma B.4 in ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":". (i). Since ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-0.png","element":"img","alt":" Ω","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.54},"width":74.08,"height":46.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-1.png","element":"img","alt":" Σ−1n","inline":true,"padRight":true},{"text":",","element":"span"}],[{"style":{"width":"73%"},"width":1383,"height":141,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-2.png","element":"img"}],[{"text":"by Assumption ","element":"span"},{"href":"#id-48","text":"6, ","element":"a"},{"href":"#id-12","text":"7(","element":"a"},{"text":"i).","element":"span"}],[{"text":"(ii). Using Assumption ","element":"span"},{"href":"#id-41","text":"4","element":"a"}],[{"style":{"width":"76%"},"width":1437,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-3.png","element":"img"}],[{"text":"We have the desired result by ","element":"span"},{"href":"#id-135","text":"(A.65)","element":"a"},{"text":", and since for an invertible matrix A, Eigmax","element":"span"},{"style":{"height":28.8},"width":124.8,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-4.png","element":"img","alt":"�A−1�","inline":true},{"text":"= 1","element":"span"},{"text":"/","element":"span"},{"text":"Eigmin(","element":"span"},{"text":"A","element":"span"},{"text":").","element":"span"}],[{"style":{"width":"21%"},"width":403,"height":86,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-5.png","element":"img"}],[{"text":"As described above in the main text, we form the symmetrized version of our feasible nodewise regression estimator for this part of the paper: ","element":"span"},{"style":{"height":15.51},"width":93.28,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-6.png","element":"img","alt":"�Ωsym","inline":true,"padRight":true},{"text":":=","element":"span"},{"style":{"height":19.51},"width":86.32,"height":48.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-7.png","element":"img","alt":"�Ω+�Ω′2","inline":true,"padRight":true},{"text":".","element":"span"}],[{"id":"id-136","style":{"width":"80%"},"width":1503,"height":378,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-8.png","element":"img"}],[{"style":{"height":10.8},"width":121.4,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-9.png","element":"img","alt":"with �G","inline":true,"padRight":true},{"text":":=","element":"span"},{"style":{"height":28.8},"width":24,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-10.png","element":"img","alt":"�","inline":true},{"text":"[ ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-11.png","element":"img","alt":"�","inline":true},{"text":"cov(","element":"span"},{"style":{"height":17.36},"width":108.64,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-12.png","element":"img","alt":"f t)]−1","inline":true,"padRight":true},{"text":"+ ","element":"span"},{"style":{"height":32.34},"width":258.24,"height":80.84,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-13.png","element":"img","alt":"�B′ �Ωsym �B�−1.","inline":true}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-136","text":"A.12","element":"a"},{"text":". (i). We start with simple adding and subtracting( ","element":"span"},{"style":{"height":10.8},"width":35,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-14.png","element":"img","alt":"�B","inline":true,"padRight":true},{"text":"= ( ","element":"span"},{"style":{"height":10.8},"width":122.36,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-15.png","element":"img","alt":"�B − B","inline":true},{"text":") + ","element":"span"},{"text":"B","element":"span"},{"text":"), ","element":"span"},{"style":{"height":15.5},"width":93.28,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-16.png","element":"img","alt":"�Ωsym","inline":true,"padRight":true},{"text":"= (","element":"span"},{"style":{"height":15.5},"width":177.48,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-17.png","element":"img","alt":"�Ωsym − Ω","inline":true},{"text":") + ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-18.png","element":"img","alt":" Ω","inline":true},{"text":") and the triangle inequality. Hence,","element":"span"}],[{"style":{"width":"89%"},"width":1680,"height":201,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-19.png","element":"img"}],[{"text":"Analyze each term in ","element":"span"},{"href":"#id-137","text":"(A.66)","element":"a"},{"text":", and by Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"ii)(iv)","element":"span"}],[{"style":{"width":"92%"},"width":1726,"height":436,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-20.png","element":"img"}],[{"text":"where we use (B.14) of Fan et al (2011) which is: ","element":"span"},{"style":{"height":16.7},"width":295.4,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-21.png","element":"img","alt":" ∥ �B − B∥∞ = Op","inline":true}],[{"style":{"width":"71%"},"width":1342,"height":78,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-22.png","element":"img"}],[{"text":"since ","element":"span"},{"style":{"height":13.1},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-23.png","element":"img","alt":" l1","inline":true,"padRight":true},{"text":"norm of transpose of ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-24.png","element":"img","alt":"�Ω","inline":true,"padRight":true},{"text":"involves rows of ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-25.png","element":"img","alt":"�Ω","inline":true,"padRight":true},{"text":"(hence columns of ","element":"span"},{"style":{"height":11.71},"width":47.12,"height":29.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/56-26.png","element":"img","alt":"�Ω′","inline":true},{"text":").","element":"span"}],[{"text":"For the second term in ","element":"span"},{"href":"#id-137","text":"(A.66)","element":"a"}],[{"style":{"width":"85%"},"width":1609,"height":475,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-0.png","element":"img"}],[{"text":"where we use, ","element":"span"},{"style":{"height":15.5},"width":93.28,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-1.png","element":"img","alt":"�Ωsym","inline":true},{"text":", Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"ii)(iv) for the first-second inequalities, (B.14) of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":", Assumption ","element":"span"},{"href":"#id-48","text":"6, ","element":"a"},{"text":"and Theorem ","element":"span"},{"href":"#id-43","text":"1 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-139","text":"(A.68) ","element":"a"},{"text":"for the rates. Now consider the third term in ","element":"span"},{"href":"#id-137","text":"(A.66)","element":"a"}],[{"style":{"width":"90%"},"width":1700,"height":153,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-2.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"ii) for the first inequality, (B.14) of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":", and ","element":"span"},{"style":{"height":16},"width":110.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-3.png","element":"img","alt":" ∥Ω∥l∞","inline":true,"padRight":true},{"text":":= max","element":"span"},{"style":{"height":18.43},"width":211.36,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-4.png","element":"img","alt":"1≤j≤p ∥Ω′j∥1","inline":true,"padRight":true},{"text":"= ","element":"span"},{"text":"max","element":"span"},{"style":{"height":28.8},"width":420.96,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-5.png","element":"img","alt":"1≤j≤p ∥Ωj∥1 = O�¯s1/2�","inline":true},{"text":"as in ","element":"span"},{"href":"#id-97","text":"(A.12)","element":"a"},{"text":". We consider the fourth term in ","element":"span"},{"href":"#id-137","text":"(A.66)","element":"a"}],[{"id":"id-140","style":{"width":"81%"},"width":1530,"height":277,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-6.png","element":"img"}],[{"text":"where we use symmetry of ","element":"span"},{"style":{"height":15.5},"width":93.28,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-7.png","element":"img","alt":"�Ωsym","inline":true},{"text":", Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"ii)(iv) for the first and second inequality, and Assumption ","element":"span"},{"href":"#id-48","text":"6, ","element":"a"},{"text":"and Theorem 1 ","element":"span"},{"href":"#id-139","text":"(A.68) ","element":"a"},{"text":"for the rates. Also analyze the fifth term in ","element":"span"},{"href":"#id-137","text":"(A.66)","element":"a"}],[{"id":"id-141","style":{"width":"90%"},"width":1699,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-8.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"ii) for the inequality, and the rates are by (B.14) of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":", Assumption ","element":"span"},{"href":"#id-48","text":"6, ","element":"a"},{"text":"and ","element":"span"},{"style":{"height":16},"width":110.52,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-9.png","element":"img","alt":" ∥Ω∥l∞","inline":true,"padRight":true},{"text":":= max","element":"span"},{"style":{"height":18.43},"width":211.36,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-10.png","element":"img","alt":"1≤j≤p ∥Ω′j∥1","inline":true,"padRight":true},{"text":"= max","element":"span"},{"style":{"height":28.8},"width":434.88,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-11.png","element":"img","alt":"1≤j≤p ∥Ωj∥1 = O�¯s1/2�","inline":true},{"text":"as in ","element":"span"},{"href":"#id-97","text":"(A.12)","element":"a"},{"text":". The slowest rate is the","element":"span"}],[{"text":"maximum of the rates ","element":"span"},{"href":"#id-140","text":"(A.71) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-141","text":"(A.72) ","element":"a"},{"text":"above. So,","element":"span"}],[{"style":{"width":"79%"},"width":1484,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-12.png","element":"img"}],[{"text":"Then, by norm inequality tying spectral norm to ","element":"span"},{"style":{"height":16},"width":83.36,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-13.png","element":"img","alt":" ∥.∥∞","inline":true,"padRight":true},{"text":"norm in p.365 of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson ","element":"a"},{"href":"#id-31","text":"(2013)","element":"a"},{"text":", and since ","element":"span"},{"style":{"height":16.42},"width":344.12,"height":41.04,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-14.png","element":"img","alt":"�B′ �Ωsym �B − B′ΩB","inline":true,"padRight":true},{"text":"is ","element":"span"},{"style":{"height":10.8},"width":121.44,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-15.png","element":"img","alt":" K × K","inline":true,"padRight":true},{"text":"matrix","element":"span"}],[{"id":"id-142","style":{"width":"96%"},"width":1811,"height":195,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-16.png","element":"img"}],[{"text":"(ii). ","element":"span"},{"text":"Since ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-17.png","element":"img","alt":"�","inline":true},{"text":"cov(","element":"span"},{"style":{"height":25.42},"width":110.36,"height":63.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-18.png","element":"img","alt":"f t)−1,","inline":true,"padRight":true},{"text":"(cov(","element":"span"},{"style":{"height":17.55},"width":112.96,"height":43.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-19.png","element":"img","alt":"f t))−1","inline":true,"padRight":true},{"text":"does not involve the precision matrix estimator, we proceed as in ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":", Lemma B5(ii). Specifically (B.20) of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011) ","element":"a"},{"text":"provide","element":"span"}],[{"style":{"width":"45%"},"width":843,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/57-20.png","element":"img"}],[{"text":"Using ","element":"span"},{"href":"#id-142","text":"(A.74) ","element":"a"},{"text":"and the equation above we develop a larger bound","element":"span"}],[{"id":"id-149","style":{"height":16},"width":20,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-0.png","element":"img","alt":"∥","inline":true},{"text":"([ ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-1.png","element":"img","alt":"�","inline":true},{"text":"cov(","element":"span"},{"style":{"height":18.32},"width":108.64,"height":45.8,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-2.png","element":"img","alt":"f t)]−1","inline":true,"padRight":true},{"text":"+ ","element":"span"},{"style":{"height":16.7},"width":231.16,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-3.png","element":"img","alt":"�B′ �Ωsym �B)−","inline":true},{"text":"([cov(","element":"span"},{"style":{"height":19.02},"width":433.16,"height":47.56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-4.png","element":"img","alt":"f t)]−1 +B′ΩB)∥l2 = Op","inline":true}],[{"style":{"width":"42%"},"width":794,"height":48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-5.png","element":"img"}],[{"text":"Note that","element":"span"}],[{"id":"id-143","style":{"width":"99%"},"width":1867,"height":312,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-6.png","element":"img"}],[{"text":"Then using Lemma A.1(i) of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":", with ","element":"span"},{"href":"#id-135","text":"(A.65) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-143","text":"(A.76)","element":"a"}],[{"id":"id-144","style":{"width":"69%"},"width":1294,"height":79,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-7.png","element":"img"}],[{"text":"wpa1 with ","element":"span"},{"style":{"height":12},"width":145.36,"height":30,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-8.png","element":"img","alt":" rn << p","inline":true,"padRight":true},{"text":"as in Assumption ","element":"span"},{"href":"#id-12","text":"7. ","element":"a"},{"text":"By ","element":"span"},{"href":"#id-144","text":"(A.77)","element":"a"},{"text":", and seeing that for invertible matrix ","element":"span"},{"text":"A","element":"span"},{"text":", Eigmax(","element":"span"},{"style":{"height":14.51},"width":76,"height":36.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-9.png","element":"img","alt":"A−1","inline":true},{"text":") =","element":"span"}],[{"text":"1","element":"span"},{"text":"/","element":"span"},{"text":"Eigmin(","element":"span"},{"text":"A","element":"span"},{"text":"),","element":"span"}],[{"style":{"width":"70%"},"width":1329,"height":160,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-10.png","element":"img"}],[{"text":"We restate the definitions of major terms that are used.","element":"span"}],[{"id":"id-148","style":{"width":"96%"},"width":1808,"height":289,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-11.png","element":"img"}],[{"text":"and","element":"span"}],[{"id":"id-165","style":{"width":"75%"},"width":1414,"height":107,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-12.png","element":"img"}],[{"text":"We have the next lemma which will be instrumental in proving Theorem 2.","element":"span"}],[{"id":"id-145","style":{"width":"63%"},"width":1187,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-13.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-145","text":"A.13","element":"a"},{"text":". Start with, by adding and subtracting and triangle inequality","element":"span"}],[{"id":"id-146","style":{"width":"78%"},"width":1478,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-14.png","element":"img"}],[{"text":"Consider the first term in ","element":"span"},{"href":"#id-146","text":"(A.82)","element":"a"}],[{"style":{"width":"88%"},"width":1661,"height":297,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/58-15.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"i) for the first inequality, Lemma ","element":"span"},{"href":"#id-147","text":"A.10-","element":"a"},{"href":"#id-136","text":"A.12, ","element":"a"},{"text":"and (B.14) of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":":","element":"span"}],[{"style":{"width":"99%"},"width":1863,"height":321,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-0.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"i) for the first inequality, and for the rates use Lemma ","element":"span"},{"href":"#id-147","text":"A.10-","element":"a"},{"href":"#id-136","text":"A.12, ","element":"a"},{"text":"and Assumption ","element":"span"},{"href":"#id-48","text":"6 ","element":"a"},{"text":"which shows that factor loadings are uniformly bounded away from infinity. Analyze the third term in ","element":"span"},{"href":"#id-146","text":"(A.82)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"90%"},"width":1685,"height":186,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-1.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"i) for the first inequality, Lemma ","element":"span"},{"href":"#id-147","text":"A.10-","element":"a"},{"href":"#id-136","text":"A.12, ","element":"a"},{"text":"and (B.14) of ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011)","element":"a"},{"text":":","element":"span"}],[{"id":"id-151","style":{"width":"99%"},"width":1864,"height":186,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-2.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"i). ","element":"span"},{"text":"We have from ","element":"span"},{"href":"#id-148","text":"(A.78)","element":"a"},{"href":"#id-148","text":"(A.79) ","element":"a"},{"text":"and by submultiplicativity of ","element":"span"},{"style":{"height":13.11},"width":28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-3.png","element":"img","alt":" l2","inline":true,"padRight":true},{"text":"matrix norm","element":"span"}],[{"text":"(spectral norm)","element":"span"}],[{"id":"id-150","style":{"width":"89%"},"width":1672,"height":336,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-4.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-136","text":"A.12, ","element":"a"},{"text":"and ","element":"span"},{"style":{"height":16},"width":291.28,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-5.png","element":"img","alt":" ∥G∥l2 = O(rn/p","inline":true},{"text":") by Lemma ","element":"span"},{"href":"#id-134","text":"A.11, ","element":"a"},{"href":"#id-149","text":"(A.75)","element":"a"},{"text":". Substitute ","element":"span"},{"href":"#id-150","text":"(A.87) ","element":"a"},{"text":"into ","element":"span"},{"href":"#id-151","text":"(A.86) ","element":"a"},{"text":"via Lemma ","element":"span"},{"href":"#id-147","text":"A.10","element":"a"}],[{"style":{"width":"84%"},"width":1586,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-6.png","element":"img"}],[{"text":"Since the last rate is the slowest among all on the right side of ","element":"span"},{"href":"#id-146","text":"(A.82) ","element":"a"},{"text":"we have the desired result.","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-7.png","element":"img"}],[{"text":"Proof of Theorem ","element":"span"},{"href":"#id-88","text":"2","element":"a"},{"text":". From ","element":"span"},{"href":"#id-47","text":"(21)","element":"a"},{"text":", and using triangle inequality","element":"span"}],[{"id":"id-152","style":{"width":"92%"},"width":1735,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-8.png","element":"img"}],[{"text":"We consider second right side term in ","element":"span"},{"href":"#id-152","text":"(A.89)","element":"a"},{"text":". Add and subtract ","element":"span"},{"style":{"height":17.23},"width":113.16,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-9.png","element":"img","alt":" Ω′j �L�Ω","inline":true,"padRight":true},{"text":"via triangle inequality","element":"span"}],[{"id":"id-153","style":{"width":"86%"},"width":1618,"height":78,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/59-10.png","element":"img"}],[{"style":{"width":"0%"},"width":7,"height":2,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-0.png","element":"img"}],[{"text":"We analyze the first term on the right side of ","element":"span"},{"href":"#id-153","text":"(A.90) ","element":"a"},{"text":"and try to simplify by adding and subtracting (","element":"span"},{"style":{"height":15.5},"width":90.52,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-1.png","element":"img","alt":"�Ωj −","inline":true}],[{"style":{"height":16.7},"width":139.56,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-2.png","element":"img","alt":"Ωj)′ �LΩ","inline":true},{"text":", and triangle inequality","element":"span"}],[{"id":"id-154","style":{"width":"91%"},"width":1717,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-3.png","element":"img"}],[{"text":"Then on the first right side term in ","element":"span"},{"href":"#id-154","text":"(A.91) ","element":"a"},{"text":"add and subtract (","element":"span"},{"style":{"height":16.7},"width":335.4,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-4.png","element":"img","alt":"�Ωj − Ωj)′L(�Ω − Ω","inline":true},{"text":") via triangle inequality","element":"span"}],[{"style":{"width":"96%"},"width":1802,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-5.png","element":"img"}],[{"text":"Now for the second right side term in ","element":"span"},{"href":"#id-154","text":"(A.91) ","element":"a"},{"text":"add and subtract (","element":"span"},{"style":{"height":16.7},"width":237.96,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-6.png","element":"img","alt":"�Ωj − Ωj)′LΩ","inline":true,"padRight":true},{"text":"via triangle inequality","element":"span"}],[{"style":{"width":"78%"},"width":1462,"height":72,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-7.png","element":"img"}],[{"text":"Substitute the last two inequalities into ","element":"span"},{"href":"#id-154","text":"(A.91)","element":"a"}],[{"id":"id-155","style":{"width":"80%"},"width":1511,"height":390,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-8.png","element":"img"}],[{"text":"Now in ","element":"span"},{"href":"#id-153","text":"(A.90) ","element":"a"},{"text":"we consider the second term on the right side, add and subtract ","element":"span"},{"style":{"height":17.04},"width":113.16,"height":42.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-9.png","element":"img","alt":" Ω′jL�Ω","inline":true,"padRight":true},{"text":"via triangle inequality","element":"span"}],[{"id":"id-156","style":{"width":"99%"},"width":1868,"height":471,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-10.png","element":"img"}],[{"text":"Combine ","element":"span"},{"href":"#id-155","text":"(A.92)","element":"a"},{"href":"#id-156","text":"(A.94) ","element":"a"},{"text":"into ","element":"span"},{"href":"#id-153","text":"(A.90) ","element":"a"},{"text":"right side to have","element":"span"}],[{"id":"id-157","style":{"width":"93%"},"width":1754,"height":347,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-11.png","element":"img"}],[{"text":"To consider all the terms in ","element":"span"},{"href":"#id-157","text":"(A.95) ","element":"a"},{"text":"we need to find some rates about terms. In that respect,","element":"span"}],[{"id":"id-161","style":{"width":"74%"},"width":1393,"height":227,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/60-12.png","element":"img"}],[{"text":"where we use definition of ","element":"span"},{"text":"L ","element":"span"},{"text":"for the first equality in ","element":"span"},{"href":"#id-148","text":"(A.80)","element":"a"},{"text":", ","element":"span"},{"text":"G ","element":"span"},{"text":"is defined in ","element":"span"},{"href":"#id-148","text":"(A.79)","element":"a"},{"text":", and we use submultiplicativity of ","element":"span"},{"style":{"height":13.1},"width":44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-0.png","element":"img","alt":" l∞","inline":true,"padRight":true},{"text":"norm for the first inequality, and the relation between spectral norm and ","element":"span"},{"style":{"height":13.1},"width":44,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-1.png","element":"img","alt":" l∞","inline":true,"padRight":true},{"text":"norm from p.365 of ","element":"span"},{"href":"#id-31","text":"Horn and Johnson ","element":"a"},{"href":"#id-31","text":"(2013) ","element":"a"},{"text":"for the second inequality, and the rates are from ","element":"span"},{"href":"#id-133","text":"(A.64)","element":"a"},{"text":", Lem","element":"span"},{"href":"#id-8","text":"ma A.10, Lemma","element":"a"}],[{"href":"#id-134","text":"A.11 ","element":"a"},{"text":"and ","element":"span"},{"text":"G ","element":"span"},{"text":"definition. Next we need the following, by using the same analysis in (B.55) of ","element":"span"},{"href":"#id-8","text":"Caner and Kock ","element":"a"},{"href":"#id-8","text":"(2018) ","element":"a"},{"text":"via strict stationary of the data, or ","element":"span"},{"href":"#id-97","text":"(A.12) ","element":"a"},{"text":"here","element":"span"}],[{"id":"id-162","style":{"width":"71%"},"width":1344,"height":67,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-2.png","element":"img"}],[{"text":"We consider each term on the right side of ","element":"span"},{"href":"#id-157","text":"(A.95)","element":"a"},{"text":".","element":"span"}],[{"id":"id-158","style":{"height":16.7},"width":783.16,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-3.png","element":"img","alt":"max1≤j≤p∥(�Ωj − Ωj)′(�L − L)(�Ω − Ω)∥1 ≤","inline":true,"padRight":true},{"text":"[ max","element":"span"},{"style":{"height":26.9},"width":507,"height":67.24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-4.png","element":"img","alt":"1≤j≤p ∥�Ωj − Ωj∥21]∥�L − L∥l∞","inline":true,"padRight":true},{"text":"(A.98) = ","element":"span"},{"text":"[","element":"span"},{"style":{"height":15.5},"width":47.24,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-5.png","element":"img","alt":"Op","inline":true,"padRight":true},{"text":"(¯","element":"span"},{"style":{"height":18.83},"width":232.28,"height":47.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-6.png","element":"img","alt":"sλn)]2Op(ln),","inline":true,"padRight":true},{"text":"(A.99)","element":"span"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"iii), and","element":"span"}],[{"id":"id-159","style":{"width":"74%"},"width":1400,"height":77,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-7.png","element":"img"}],[{"text":"for the inequality in ","element":"span"},{"href":"#id-158","text":"(A.98) ","element":"a"},{"text":"and use Lemma ","element":"span"},{"href":"#id-145","text":"A.13, ","element":"a"},{"text":"and Theorem ","element":"span"},{"href":"#id-43","text":"1 ","element":"a"},{"text":"for the rates.","element":"span"}],[{"text":"We consider the second term on the right side of ","element":"span"},{"href":"#id-157","text":"(A.95)","element":"a"},{"text":".","element":"span"}],[{"id":"id-160","style":{"width":"80%"},"width":1510,"height":140,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-8.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"iii), and ","element":"span"},{"href":"#id-159","text":"(A.100) ","element":"a"},{"text":"for the inequality in ","element":"span"},{"href":"#id-160","text":"(A.101) ","element":"a"},{"text":"and use ","element":"span"},{"href":"#id-161","text":"(A.96)","element":"a"},{"text":", and Theorem ","element":"span"},{"href":"#id-43","text":"1 ","element":"a"},{"text":"for","element":"span"}],[{"text":"the rates. We analyze the third term on the right side of ","element":"span"},{"href":"#id-157","text":"(A.95)","element":"a"}],[{"style":{"width":"83%"},"width":1572,"height":141,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-9.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"iii) for the first inequality, and the rates are by ","element":"span"},{"href":"#id-162","text":"(A.97)","element":"a"},{"text":", Lemma ","element":"span"},{"href":"#id-145","text":"A.13, ","element":"a"},{"text":"Theorem 1.","element":"span"}],[{"text":"Now consider the fourth term on the right side of ","element":"span"},{"href":"#id-157","text":"(A.95)","element":"a"}],[{"style":{"width":"78%"},"width":1476,"height":164,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-10.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"iii) for the inequality, and Theorem ","element":"span"},{"href":"#id-43","text":"1, ","element":"a"},{"href":"#id-161","text":"(A.96)","element":"a"},{"href":"#id-162","text":"(A.97) ","element":"a"},{"text":"for the rate. Now consider the fifth term on the right side of ","element":"span"},{"href":"#id-157","text":"(A.95)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"82%"},"width":1542,"height":140,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-11.png","element":"img"}],[{"text":"where we use lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"iii) for the first inequality, and Theorem ","element":"span"},{"href":"#id-43","text":"1, ","element":"a"},{"text":"Lemma ","element":"span"},{"href":"#id-145","text":"A.13, ","element":"a"},{"href":"#id-162","text":"(A.97) ","element":"a"},{"text":"for the rates. Consider the sixth term on the right side of ","element":"span"},{"href":"#id-157","text":"(A.95)","element":"a"}],[{"id":"id-164","style":{"width":"79%"},"width":1487,"height":73,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/61-12.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"iii) for the inequality, and use ","element":"span"},{"href":"#id-162","text":"(A.97)","element":"a"},{"text":", and Lemma ","element":"span"},{"href":"#id-145","text":"A.13 ","element":"a"},{"text":"for the rates. Now analyze","element":"span"}],[{"text":"the seventh term on the right side of ","element":"span"},{"href":"#id-157","text":"(A.95)","element":"a"}],[{"id":"id-163","style":{"width":"77%"},"width":1450,"height":141,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-0.png","element":"img"}],[{"text":"where we use Lemma ","element":"span"},{"href":"#id-138","text":"A.9(","element":"a"},{"text":"iii) for the inequality, and for the rates we use ","element":"span"},{"href":"#id-161","text":"(A.96)","element":"a"},{"href":"#id-162","text":"(A.97) ","element":"a"},{"text":"Theorem 1. Note that among all ","element":"span"},{"href":"#id-158","text":"(A.99)","element":"a"},{"text":"-","element":"span"},{"href":"#id-163","text":"(A.106)","element":"a"},{"text":", the slowest rate is by ","element":"span"},{"href":"#id-164","text":"(A.105) ","element":"a"},{"text":"by the definition of ","element":"span"},{"style":{"height":13.1},"width":32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-1.png","element":"img","alt":" ln","inline":true,"padRight":true},{"text":"in ","element":"span"},{"href":"#id-165","text":"(A.81) ","element":"a"},{"text":"and by Assumption","element":"span"}],[{"id":"id-166","style":{"width":"99%"},"width":1867,"height":218,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-2.png","element":"img"}],[{"text":"This ends the proof of (i) with using Theorem 1 and ","element":"span"},{"href":"#id-166","text":"(A.107) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-152","text":"(A.89)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"53%"},"width":1006,"height":100,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-3.png","element":"img"}],[{"text":"as in ","element":"span"},{"href":"#id-17","text":"Fan et al. ","element":"a"},{"href":"#id-17","text":"(2011) ","element":"a"},{"text":"with ","element":"span"},{"style":{"height":10},"width":86.4,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-4.png","element":"img","alt":" yt, ut","inline":true,"padRight":true},{"text":"being ","element":"span"},{"style":{"height":11.2},"width":61.72,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-5.png","element":"img","alt":" p ×","inline":true,"padRight":true},{"text":"1 vector of asset returns, and errors respectively at time ","element":"span"},{"text":"t ","element":"span"},{"text":"= 1","element":"span"},{"style":{"height":10},"width":119.04,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-6.png","element":"img","alt":", · · · , n","inline":true},{"text":".","element":"span"}],[{"style":{"width":"40%"},"width":765,"height":125,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-7.png","element":"img"}],[{"text":"by Assumption ","element":"span"},{"href":"#id-38","text":"1. ","element":"a"},{"text":"Consider","element":"span"}],[{"style":{"width":"56%"},"width":1064,"height":396,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-8.png","element":"img"}],[{"text":"Clearly, by the proof of Lemma ","element":"span"},{"href":"#id-89","text":"A.1(","element":"a"},{"text":"i) here we have ","element":"span"},{"style":{"height":16},"width":408.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-9.png","element":"img","alt":" ∥Ax∥∞ ≤ ∥A∥l∞∥x∥∞","inline":true,"padRight":true},{"text":"for a generic vector ","element":"span"},{"text":"x","element":"span"},{"text":", and a matrix ","element":"span"},{"text":"A","element":"span"},{"text":". Then, by Lemma ","element":"span"},{"href":"#id-147","text":"A.10(","element":"a"},{"text":"iii) and Theorem A.1 we get the rate.","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-10.png","element":"img"}],[{"text":"Part 4","element":"span"}],[{"text":"First, we start with a maximal eigenvalue bound which will be used in the proof of Theorem 8. Here, we","element":"span"}],[{"text":"provide the rate for maximal eigenvalue of covariance matrix of returns ","element":"span"},{"style":{"height":15.5},"width":49.12,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-11.png","element":"img","alt":" Σy","inline":true},{"text":". See that","element":"span"}],[{"text":"Eigmax(","element":"span"},{"style":{"height":16.7},"width":110.2,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-12.png","element":"img","alt":"Σy) ≤","inline":true,"padRight":true},{"text":"Eigmax[","element":"span"},{"text":"B","element":"span"},{"text":"cov(","element":"span"},{"style":{"height":16},"width":93.2,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-13.png","element":"img","alt":"f)B′","inline":true},{"text":"] + Eigmax(","element":"span"},{"style":{"height":16},"width":112.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-14.png","element":"img","alt":"Σn) ≤","inline":true,"padRight":true},{"text":"Eigmax(cov(","element":"span"},{"text":"f","element":"span"},{"text":"))Eigmax(","element":"span"},{"style":{"height":10.8},"width":86.96,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-15.png","element":"img","alt":"BB′","inline":true},{"text":") + Eigmax(","element":"span"},{"style":{"height":16},"width":81.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-16.png","element":"img","alt":"Σn).","inline":true}],[{"text":"Since by Assumption ","element":"span"},{"href":"#id-12","text":"7, ","element":"a"},{"style":{"height":16},"width":134.08,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-17.png","element":"img","alt":" rn/p →","inline":true,"padRight":true},{"text":"0, and ","element":"span"},{"href":"#id-69","text":"Eigmax(","element":"a"},{"href":"#id-69","style":{"height":16},"width":199.04,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-18.png","element":"img","alt":"Σn) ≤ Crn","inline":true},{"text":", with the above inequality and specifically by","element":"span"}],[{"id":"id-198","style":{"width":"99%"},"width":1868,"height":162,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/62-19.png","element":"img"}],[{"text":"This i","element":"span"},{"href":"#id-69","text":"s true for c","element":"a"},{"href":"#id-69","text":"ov(","element":"a"},{"text":"f","element":"span"},{"text":") = ","element":"span"},{"style":{"height":13.1},"width":40.52,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-0.png","element":"img","alt":" Ik","inline":true,"padRight":true},{"text":"in ","element":"span"},{"href":"#id-69","text":"Fan et al. ","element":"a"},{"href":"#id-69","text":"(2013)","element":"a"},{"text":". The result holds for general cov(","element":"span"},{"text":"f","element":"span"},{"text":") as discussed in section","element":"span"}],[{"style":{"width":"44%"},"width":842,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-1.png","element":"img"}],[{"text":"Proof of Theorem ","element":"span"},{"href":"#id-167","text":"3","element":"a"},{"text":". First, we start with definitions of ","element":"span"},{"style":{"height":11.6},"width":35,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-2.png","element":"img","alt":"�A","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.24},"width":208.68,"height":45.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-3.png","element":"img","alt":" 1′p�Γ1p/p, �F","inline":true,"padRight":true},{"text":":= 1","element":"span"},{"style":{"height":18.24},"width":174.2,"height":45.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-4.png","element":"img","alt":"′p�Γ�µ/p, A","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":201.04,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-5.png","element":"img","alt":" 1′pΣ−1y 1p/p","inline":true},{"text":", ","element":"span"},{"text":"F ","element":"span"},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":187.6,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-6.png","element":"img","alt":" 1′pΣ−1y µ/p","inline":true},{"text":".","element":"span"}],[{"id":"id-168","style":{"width":"74%"},"width":1404,"height":281,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-7.png","element":"img"}],[{"text":"Now consider the numerator in ","element":"span"},{"href":"#id-168","text":"(A.109)","element":"a"},{"text":":","element":"span"}],[{"id":"id-169","style":{"width":"88%"},"width":1664,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-8.png","element":"img"}],[{"text":"Analyze the first term on the right side of ","element":"span"},{"href":"#id-169","text":"(A.110)","element":"a"},{"text":":","element":"span"}],[{"id":"id-172","style":{"width":"72%"},"width":1359,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-9.png","element":"img"}],[{"text":"Then, by Lemma ","element":"span"},{"href":"#id-64","text":"B.3 ","element":"a"},{"text":"in Supplement B, via Assumption ","element":"span"},{"href":"#id-58","text":"8","element":"a"}],[{"id":"id-170","style":{"width":"63%"},"width":1183,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-10.png","element":"img"}],[{"text":"Then,","element":"span"}],[{"id":"id-171","style":{"width":"69%"},"width":1309,"height":124,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-11.png","element":"img"}],[{"text":"where we use ","element":"span"},{"href":"#id-170","text":"(A.112) ","element":"a"},{"text":"and Lemma ","element":"span"},{"href":"#id-105","text":"B.5 ","element":"a"},{"text":"in Supplement B. By ","element":"span"},{"href":"#id-170","text":"(A.112)","element":"a"},{"href":"#id-171","text":"(A.113) ","element":"a"},{"text":"and Lemma ","element":"span"},{"href":"#id-105","text":"B.5 ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-172","text":"(A.111)","element":"a"},{"text":", we have ","element":"span"},{"style":{"height":14.13},"width":47.2,"height":35.32,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-12.png","element":"img","alt":"�F 2","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":16.7},"width":127.16,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-13.png","element":"img","alt":" Op(K).","inline":true,"padRight":true},{"text":"(A.114)","element":"span"}],[{"text":"Then, by Lemma ","element":"span"},{"href":"#id-173","text":"B.2 ","element":"a"},{"text":"in Supplement B and ","element":"span"},{"href":"#id-174","text":"(A.114)","element":"a"},{"text":",","element":"span"}],[{"id":"id-174","style":{"width":"71%"},"width":1338,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-14.png","element":"img"}],[{"text":"Then, the second term on the right side of ","element":"span"},{"href":"#id-169","text":"(A.110) ","element":"a"},{"text":"is","element":"span"}],[{"id":"id-175","style":{"width":"82%"},"width":1545,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-15.png","element":"img"}],[{"text":"by ","element":"span"},{"href":"#id-170","text":"(A.112)","element":"a"},{"href":"#id-171","text":"(A.113) ","element":"a"},{"text":"and Lemma ","element":"span"},{"href":"#id-173","text":"B.2, ","element":"a"},{"text":"Lemma ","element":"span"},{"href":"#id-105","text":"B.5 ","element":"a"},{"text":"in Supplement B, and the last equality is by Assumption ","element":"span"},{"href":"#id-58","text":"8. ","element":"a"},{"text":"Use ","element":"span"},{"href":"#id-174","text":"(A.115)","element":"a"},{"href":"#id-175","text":"(A.116) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-169","text":"(A.110) ","element":"a"},{"text":"with Assumption ","element":"span"},{"href":"#id-58","text":"8","element":"a"}],[{"id":"id-176","style":{"width":"67%"},"width":1256,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-16.png","element":"img"}],[{"text":"Now consider the denominator in ","element":"span"},{"href":"#id-168","text":"(A.109)","element":"a"},{"text":". Note that","element":"span"}],[{"style":{"width":"45%"},"width":846,"height":50,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/63-17.png","element":"img"}],[{"text":"So by Assumption ","element":"span"},{"href":"#id-58","text":"8(","element":"a"},{"text":"ii)","element":"span"}],[{"id":"id-177","style":{"width":"57%"},"width":1073,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-0.png","element":"img"}],[{"text":"Next","element":"span"}],[{"id":"id-178","style":{"width":"65%"},"width":1222,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-1.png","element":"img"}],[{"text":"by ","element":"span"},{"href":"#id-174","text":"(A.115) ","element":"a"},{"text":"and Assumption ","element":"span"},{"href":"#id-58","text":"8. ","element":"a"},{"text":"Combine ","element":"span"},{"href":"#id-176","text":"(A.117) ","element":"a"},{"text":"with ","element":"span"},{"href":"#id-177","text":"(A.118)","element":"a"},{"href":"#id-178","text":"(A.119) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-168","text":"(A.109) ","element":"a"},{"text":"to obtain the desired result.","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-2.png","element":"img"}],[{"text":"Proof of Theorem ","element":"span"},{"href":"#id-179","text":"4","element":"a"},{"text":". To ease the notation in the proofs, set ","element":"span"},{"style":{"height":13.36},"width":161.92,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-3.png","element":"img","alt":" AD − F 2","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":17.2},"width":253.6,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-4.png","element":"img","alt":" z, Aρ21 − 2Fρ1","inline":true,"padRight":true},{"text":"+ ","element":"span"},{"text":"D ","element":"span"},{"text":"= ","element":"span"},{"text":"v","element":"span"},{"text":". The ","element":"span"},{"text":"estimates will be ","element":"span"},{"style":{"height":6.8},"width":21.44,"height":17,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-5.png","element":"img","alt":" �z","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":16.56},"width":206.72,"height":41.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-6.png","element":"img","alt":"�A �D − �F 2, �v","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":17.39},"width":288.36,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-7.png","element":"img","alt":"�Aρ21 − 2 �Fρ1 + �D","inline":true},{"text":". Then,","element":"span"}],[{"id":"id-180","style":{"width":"68%"},"width":1284,"height":257,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-8.png","element":"img"}],[{"text":"First, analyze the denominator of ","element":"span"},{"href":"#id-180","text":"(A.120)","element":"a"},{"text":".","element":"span"}],[{"id":"id-183","style":{"width":"70%"},"width":1319,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-9.png","element":"img"}],[{"text":"Then, by Lemma ","element":"span"},{"href":"#id-173","text":"B.2-","element":"a"},{"href":"#id-120","text":"B.4 ","element":"a"},{"text":"in Supplement B, triangle inequality and ","element":"span"},{"style":{"height":10},"width":36.64,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-10.png","element":"img","alt":" ρ1","inline":true,"padRight":true},{"text":"being bounded away from zero and","element":"span"}],[{"text":"finite, by Assumption ","element":"span"},{"href":"#id-58","text":"8,","element":"a"}],[{"id":"id-181","style":{"width":"81%"},"width":1531,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-11.png","element":"img"}],[{"text":"We also know that by the conditions in theorem statement ","element":"span"},{"style":{"height":15.76},"width":362.68,"height":39.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-12.png","element":"img","alt":" z = AD−F 2 ≥ C1 >","inline":true,"padRight":true},{"text":"0, and ","element":"span"},{"style":{"height":17.2},"width":377.08,"height":43,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-13.png","element":"img","alt":" v = Aρ21−2Fρ1+D ≥","inline":true},{"style":{"height":13.1},"width":88.6,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-14.png","element":"img","alt":"C1 >","inline":true,"padRight":true},{"text":"0. Then, see that by Lemma ","element":"span"},{"href":"#id-105","text":"B.5 ","element":"a"},{"text":"in Supplement B","element":"span"}],[{"id":"id-182","style":{"width":"64%"},"width":1209,"height":45,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-15.png","element":"img"}],[{"text":"Thus, by ","element":"span"},{"href":"#id-181","text":"(A.122)","element":"a"},{"href":"#id-182","text":"(A.123) ","element":"a"},{"text":"and ","element":"span"},{"style":{"height":14},"width":372.76,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-16.png","element":"img","alt":" z ≥ C1 > 0, v ≥ C1 >","inline":true,"padRight":true},{"text":"0 with Assumption ","element":"span"},{"href":"#id-58","text":"8: ","element":"a"},{"style":{"height":15.66},"width":158.08,"height":39.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-17.png","element":"img","alt":" K3¯sln →","inline":true,"padRight":true},{"text":"0 in ","element":"span"},{"href":"#id-183","text":"(A.121)","element":"a"},{"text":", we have","element":"span"}],[{"id":"id-187","style":{"width":"59%"},"width":1122,"height":47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-18.png","element":"img"}],[{"text":"Consider the numerator in ","element":"span"},{"href":"#id-180","text":"(A.120)","element":"a"},{"text":":","element":"span"}],[{"id":"id-186","style":{"width":"75%"},"width":1409,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-19.png","element":"img"}],[{"text":"By Lemma ","element":"span"},{"href":"#id-111","text":"B.6 ","element":"a"},{"text":"in Supplement B, and Assumption ","element":"span"},{"href":"#id-58","text":"8","element":"a"}],[{"id":"id-184","style":{"width":"76%"},"width":1436,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-20.png","element":"img"}],[{"text":"Clearly, by Lemma ","element":"span"},{"href":"#id-105","text":"B.5 ","element":"a"},{"text":"in Supplement B and triangle inequality with ","element":"span"},{"style":{"height":10},"width":36.64,"height":25,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-21.png","element":"img","alt":" ρ1","inline":true,"padRight":true},{"text":"being finite,","element":"span"}],[{"id":"id-185","style":{"width":"64%"},"width":1214,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-22.png","element":"img"}],[{"text":"Then, use ","element":"span"},{"href":"#id-181","text":"(A.122)","element":"a"},{"href":"#id-182","text":"(A.123)","element":"a"},{"href":"#id-184","text":"(A.126)","element":"a"},{"href":"#id-185","text":"(A.127) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-186","text":"(A.125) ","element":"a"},{"text":"by Assumption ","element":"span"},{"href":"#id-58","text":"8","element":"a"}],[{"id":"id-188","style":{"width":"64%"},"width":1201,"height":47,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/64-23.png","element":"img"}],[{"text":"Use ","element":"span"},{"href":"#id-187","text":"(A.124)","element":"a"},{"href":"#id-188","text":"(A.128) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-180","text":"(A.120) ","element":"a"},{"text":"to obtain the desired result.","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":298,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/65-0.png","element":"img"}],[{"text":"Lemma ","element":"span"},{"href":"#id-120","text":"B.4 ","element":"a"},{"text":"in Supplement B shows that","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":377,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/65-1.png","element":"img"}],[{"text":"Proof of Theorem ","element":"span"},{"href":"#id-62","text":"6","element":"a"},{"text":". Note that by the definition of ","element":"span"},{"style":{"height":13.11},"width":114.32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/65-2.png","element":"img","alt":" MSRc","inline":true,"padRight":true},{"text":"in ","element":"span"},{"href":"#id-122","text":"(C.2) ","element":"a"},{"text":"and ","element":"span"},{"text":"A, F, D ","element":"span"},{"text":"terms,","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":675,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/65-3.png","element":"img"}],[{"text":"by Lemma ","element":"span"},{"href":"#id-173","text":"B.2 ","element":"a"},{"text":"in Supplement B. Then by Assumption ","element":"span"},{"href":"#id-58","text":"8","element":"a"}],[{"style":{"width":"70%"},"width":1312,"height":87,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/65-4.png","element":"img"}],[{"text":"Thus, clearly we obtain, since ","element":"span"},{"style":{"height":16},"width":313.88,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/65-5.png","element":"img","alt":" | �A| ≥ A − | �A − A|","inline":true},{"text":",","element":"span"}],[{"style":{"width":"49%"},"width":931,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/65-6.png","element":"img"}],[{"text":"which implies for the denominator","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":490,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/65-7.png","element":"img"}],[{"style":{"width":"99%"},"width":1867,"height":634,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-0.png","element":"img"}],[{"text":"where the rate is the slowest among the three right-hand-side terms.","element":"span"}],[{"style":{"width":"7%"},"width":136,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-1.png","element":"img"}],[{"text":"Proof of Theorem ","element":"span"},{"href":"#id-65","text":"7","element":"a"},{"text":". Note that we define ","element":"span"},{"style":{"height":20.94},"width":135.04,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-2.png","element":"img","alt":" Γ : Σ−1y","inline":true,"padRight":true},{"text":". We need to start with","element":"span"}],[{"id":"id-190","style":{"width":"72%"},"width":1357,"height":195,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-3.png","element":"img"}],[{"text":"Define the event ","element":"span"},{"style":{"height":18.43},"width":548.96,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-4.png","element":"img","alt":" E1 = {|1′p�Γ�µ/p− 1′pΓµ/p| ≤ ǫ}","inline":true},{"text":", where ","element":"span"},{"style":{"height":9.6},"width":58.36,"height":24,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-5.png","element":"img","alt":" ǫ >","inline":true,"padRight":true},{"text":"0. We condition the proof on event ","element":"span"},{"style":{"height":13.1},"width":45.28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-6.png","element":"img","alt":" E1","inline":true},{"text":", then ","element":"span"},{"text":"at the end of the proof we show that ","element":"span"},{"style":{"height":16},"width":153.76,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-7.png","element":"img","alt":" P(E1) →","inline":true,"padRight":true},{"text":"1. Start with the condition ","element":"span"},{"style":{"height":18.24},"width":345.88,"height":45.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-8.png","element":"img","alt":" 1′pΓµ/p ≥ C > 2ǫ >","inline":true,"padRight":true},{"text":"0;","element":"span"}],[{"style":{"width":"67%"},"width":1257,"height":396,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-9.png","element":"img"}],[{"text":"where we use ","element":"span"},{"style":{"height":13.1},"width":45.28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-10.png","element":"img","alt":" E1","inline":true,"padRight":true},{"text":"in the second inequality and the condition for the third inequality. This clearly shows that at event ","element":"span"},{"style":{"height":13.1},"width":45.28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-11.png","element":"img","alt":" E1","inline":true},{"text":", when the condition ","element":"span"},{"style":{"height":18.43},"width":352.6,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-12.png","element":"img","alt":" 1′pΓµ/p ≥ C > 2ǫ >","inline":true,"padRight":true},{"text":"0 holds, we have ","element":"span"},{"style":{"height":18.43},"width":248.44,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-13.png","element":"img","alt":" 1′p�Γ�µ/p > ǫ >","inline":true,"padRight":true},{"text":"0. So","element":"span"}],[{"id":"id-189","style":{"width":"99%"},"width":1868,"height":616,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/66-14.png","element":"img"}],[{"text":"Then, in ","element":"span"},{"href":"#id-189","text":"(A.140)","element":"a"},{"text":", using the condition ","element":"span"},{"style":{"height":18.43},"width":415,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-0.png","element":"img","alt":" 1′pΓµ/p ≤ −C < −2ǫ <","inline":true,"padRight":true},{"text":"0 (note that this also implies ","element":"span"},{"style":{"height":18.43},"width":178.84,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-1.png","element":"img","alt":" 1′pΓµ/p <","inline":true,"padRight":true},{"text":"0)","element":"span"}],[{"style":{"width":"37%"},"width":705,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-2.png","element":"img"}],[{"text":"which implies that, with ","element":"span"},{"style":{"height":11.6},"width":124.64,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-3.png","element":"img","alt":" C > 2ǫ","inline":true},{"text":", adding ","element":"span"},{"style":{"height":0},"width":20,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-4.png","element":"img","alt":" ǫ","inline":true,"padRight":true},{"text":"to all sides above yields","element":"span"}],[{"style":{"width":"99%"},"width":1869,"height":259,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-5.png","element":"img"}],[{"text":"as in the maximum Sharpe Ratios in Theorem ","element":"span"},{"href":"#id-62","text":"6. ","element":"a"},{"text":"Clearly under event ","element":"span"},{"style":{"height":13.1},"width":45.28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-6.png","element":"img","alt":" E1","inline":true,"padRight":true},{"text":"with ","element":"span"},{"style":{"height":18.24},"width":383.8,"height":45.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-7.png","element":"img","alt":" 1′pΓµ/p ≥ C > 2ǫ >","inline":true,"padRight":true},{"text":"0, ","element":"span"},{"href":"#id-190","text":"(A.137) ","element":"a"},{"text":"is rewritten as","element":"span"}],[{"style":{"width":"66%"},"width":1246,"height":142,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-8.png","element":"img"}],[{"text":"where we use Theorem ","element":"span"},{"href":"#id-63","text":"5. ","element":"a"},{"text":"Under event ","element":"span"},{"style":{"height":13.1},"width":45.28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-9.png","element":"img","alt":" E1","inline":true},{"text":", with 1","element":"span"},{"style":{"height":18.43},"width":384.76,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-10.png","element":"img","alt":"′pΓµ/p ≤ −C < −2ǫ <","inline":true,"padRight":true},{"text":"0, ","element":"span"},{"href":"#id-190","text":"(A.137) ","element":"a"},{"text":"is rewritten as ","element":"span"},{"style":{"height":58.21},"width":13,"height":145.52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-11.png","element":"img","alt":"��","inline":true},{"text":"( ","element":"span"},{"style":{"height":46.16},"width":358.96,"height":115.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-12.png","element":"img","alt":"�MSRc2− MSR2c)/pMSR2c/p","inline":true}],[{"text":"where we use Theorem ","element":"span"},{"href":"#id-62","text":"6.","element":"a"}],[{"text":"Note that we can rewrite the event ","element":"span"},{"style":{"height":13.1},"width":45.28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-13.png","element":"img","alt":" E1","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":16},"width":244.64,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-14.png","element":"img","alt":" {| �F − F| ≤ ǫ}","inline":true},{"text":", with ","element":"span"},{"style":{"height":16},"width":204.8,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-15.png","element":"img","alt":" ǫ = O(K¯sln","inline":true},{"text":"). Note that event ","element":"span"},{"style":{"height":13.1},"width":45.28,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-16.png","element":"img","alt":" E1","inline":true,"padRight":true},{"text":"occurs with probability approaching one by Lemma ","element":"span"},{"href":"#id-64","text":"B.3 ","element":"a"},{"text":"in Supplement B, so we have proven the desired result.","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-17.png","element":"img"}],[{"text":"Proof of Theorem ","element":"span"},{"href":"#id-66","text":"8","element":"a"},{"text":". (A.2) of ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019) ","element":"a"},{"text":"shows that the squared ratio of the estimated maximum out-of-sample Sharpe Ratio to the theoretical ratio can be written as","element":"span"}],[{"id":"id-195","style":{"width":"67%"},"width":1264,"height":187,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-18.png","element":"img"}],[{"text":"The proof will consider the numerator and the denominator of the squared maximum out-of-sample","element":"span"}],[{"text":"Sharpe Ratio. We start with the numerator using the definition, ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-19.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-20.png","element":"img","alt":" Σ−1y","inline":true}],[{"id":"id-191","style":{"width":"62%"},"width":1172,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-21.png","element":"img"}],[{"text":"Consider the fraction on the right-hand side. Start with the numerator in ","element":"span"},{"href":"#id-191","text":"(A.145)","element":"a"},{"text":".","element":"span"}],[{"id":"id-193","style":{"width":"87%"},"width":1634,"height":480,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/67-22.png","element":"img"}],[{"text":"where we use ","element":"span"},{"href":"#id-99","text":"(B.18)","element":"a"},{"text":", ","element":"span"},{"href":"#id-192","text":"(B.19)","element":"a"},{"text":", and ","element":"span"},{"href":"#id-100","text":"(B.20) ","element":"a"},{"text":"for the rates and the dominant rate in the last equality is by","element":"span"}],[{"text":"Assumption ","element":"span"},{"href":"#id-58","text":"8 ","element":"a"},{"text":"and ","element":"span"},{"style":{"height":13.1},"width":32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-0.png","element":"img","alt":" ln","inline":true,"padRight":true},{"text":"definition ","element":"span"},{"href":"#id-12","text":"(22)","element":"a"},{"text":". By Assumption ","element":"span"},{"href":"#id-58","text":"8(","element":"a"},{"text":"ii)","element":"span"}],[{"id":"id-194","style":{"width":"56%"},"width":1058,"height":93,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-1.png","element":"img"}],[{"text":"Then, by ","element":"span"},{"href":"#id-193","text":"(A.146)","element":"a"},{"href":"#id-194","text":"(A.147) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-191","text":"(A.145)","element":"a"}],[{"id":"id-205","style":{"width":"73%"},"width":1377,"height":104,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-2.png","element":"img"}],[{"text":"We now attempt to show that the denominator in ","element":"span"},{"href":"#id-195","text":"(A.144)","element":"a"}],[{"id":"id-204","style":{"width":"56%"},"width":1067,"height":115,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-3.png","element":"img"}],[{"text":"In that respect, bearing in mind that ","element":"span"},{"style":{"height":20.94},"width":154.72,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-4.png","element":"img","alt":" Γ = Σ−1y","inline":true,"padRight":true},{"text":"is symmetric","element":"span"}],[{"id":"id-196","style":{"width":"99%"},"width":1869,"height":286,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-5.png","element":"img"}],[{"text":"Using ","element":"span"},{"href":"#id-196","text":"(A.151)","element":"a"}],[{"id":"id-197","style":{"width":"89%"},"width":1673,"height":416,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-6.png","element":"img"}],[{"text":"First, we consider ","element":"span"},{"href":"#id-197","text":"(A.152)","element":"a"},{"text":".","element":"span"}],[{"id":"id-199","style":{"width":"85%"},"width":1600,"height":434,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-7.png","element":"img"}],[{"text":"where we use H¨older’s inequality for the third inequality and Theorem ","element":"span"},{"href":"#id-88","text":"2 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-198","text":"(A.108)","element":"a"},{"text":", ","element":"span"},{"href":"#id-94","text":"(B.8) ","element":"a"},{"text":"for the rate. Now, consider ","element":"span"},{"href":"#id-197","text":"(A.153)","element":"a"},{"text":", and by definition ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-8.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.95},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/68-9.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":".","element":"span"}],[{"style":{"width":"88%"},"width":1657,"height":456,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-0.png","element":"img"}],[{"text":"by ","element":"span"},{"href":"#id-101","text":"(B.16)","element":"a"},{"href":"#id-192","text":"(B.19) ","element":"a"},{"text":"for the second equality, and the dominant rate in third equality can be seen from Assumption","element":"span"}],[{"href":"#id-58","text":"8. ","element":"a"},{"text":"Next, consider ","element":"span"},{"href":"#id-197","text":"(A.154)","element":"a"},{"text":", and recall that ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-1.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-2.png","element":"img","alt":" Σ−1y","inline":true}],[{"id":"id-201","style":{"width":"91%"},"width":1720,"height":296,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-3.png","element":"img"}],[{"text":"where we use ","element":"span"},{"href":"#id-192","text":"(B.19)","element":"a"},{"href":"#id-100","text":"(B.20) ","element":"a"},{"text":"for the second equality, and the dominant rate in the third equality can be seen","element":"span"}],[{"text":"from Assumption ","element":"span"},{"href":"#id-58","text":"8. ","element":"a"},{"text":"Consider now ","element":"span"},{"href":"#id-197","text":"(A.155) ","element":"a"},{"text":"by the symmetry of ","element":"span"},{"style":{"height":20.95},"width":154.72,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-4.png","element":"img","alt":" Γ = Σ−1y","inline":true}],[{"style":{"width":"97%"},"width":1817,"height":140,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-5.png","element":"img"}],[{"text":"by ","element":"span"},{"href":"#id-98","text":"(B.17)","element":"a"},{"text":". Next, analyze ","element":"span"},{"href":"#id-197","text":"(A.156) ","element":"a"},{"text":"by the symmetricity of ","element":"span"},{"style":{"height":20.94},"width":154.72,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-6.png","element":"img","alt":" Γ = Σ−1y","inline":true}],[{"id":"id-200","style":{"width":"91%"},"width":1711,"height":97,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-7.png","element":"img"}],[{"text":"by ","element":"span"},{"href":"#id-99","text":"(B.18)","element":"a"},{"text":". Combine the rates and terms ","element":"span"},{"href":"#id-199","text":"(A.157)","element":"a"},{"text":"-","element":"span"},{"href":"#id-200","text":"(A.161) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-197","text":"(A.152)","element":"a"},{"text":"-","element":"span"},{"href":"#id-197","text":"(A.156) ","element":"a"},{"text":"to obtain","element":"span"}],[{"id":"id-202","style":{"width":"68%"},"width":1290,"height":58,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-8.png","element":"img"}],[{"text":"by the dominant rate in ","element":"span"},{"href":"#id-201","text":"(A.159)","element":"a"},{"text":", as seen in Assumption ","element":"span"},{"href":"#id-66","text":"9: ","element":"a"},{"style":{"height":14},"width":133.12,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-9.png","element":"img","alt":" p¯sln →","inline":true,"padRight":true},{"text":"0 in ","element":"span"},{"href":"#id-199","text":"(A.157)","element":"a"},{"text":", and ","element":"span"},{"style":{"height":13.11},"width":32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-10.png","element":"img","alt":" ln","inline":true,"padRight":true},{"text":"definition in Assumption ","element":"span"},{"href":"#id-12","text":"7.","element":"a"}],[{"id":"id-203","style":{"width":"96%"},"width":1807,"height":137,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-11.png","element":"img"}],[{"text":"Combine ","element":"span"},{"href":"#id-202","text":"(A.162)","element":"a"},{"href":"#id-203","text":"(A.163)","element":"a"},{"text":", in the second right side term in ","element":"span"},{"href":"#id-196","text":"(A.150) ","element":"a"},{"text":"via Assumption ","element":"span"},{"href":"#id-58","text":"8","element":"a"}],[{"id":"id-221","style":{"width":"72%"},"width":1366,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-12.png","element":"img"}],[{"text":"Therefore, we show ","element":"span"},{"href":"#id-204","text":"(A.149) ","element":"a"},{"text":"via ","element":"span"},{"href":"#id-196","text":"(A.150)","element":"a"},{"text":". Then, combine ","element":"span"},{"href":"#id-205","text":"(A.148)","element":"a"},{"href":"#id-204","text":"(A.149) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-195","text":"(A.144) ","element":"a"},{"text":"to obtain the desired result.","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/69-13.png","element":"img"}]]},{"heading":"Supplement B","paragraphs":[[{"text":"Here, we provide results that are used in proofs of Section 4. We provide a matrix norm inequality. Let ","element":"span"},{"text":"x ","element":"span"},{"text":"be a generic vector, which is ","element":"span"},{"style":{"height":11.2},"width":56.92,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-0.png","element":"img","alt":" p×","inline":true,"padRight":true},{"text":"1. ","element":"span"},{"text":"M ","element":"span"},{"text":"is a square matrix of dimension ","element":"span"},{"text":"p","element":"span"},{"text":", where ","element":"span"},{"style":{"height":17.23},"width":64.4,"height":43.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-1.png","element":"img","alt":" M ′j","inline":true,"padRight":true},{"text":"is the ","element":"span"},{"text":"j","element":"span"},{"text":"th row of dimension ","element":"span"},{"text":"1 ","element":"span"},{"style":{"height":11.2},"width":61.84,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-2.png","element":"img","alt":" × p","inline":true},{"text":", and ","element":"span"},{"style":{"height":15.5},"width":62.92,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-3.png","element":"img","alt":" M j","inline":true,"padRight":true},{"text":"is the transpose of this row vector.","element":"span"}],[{"id":"id-89","style":{"width":"99%"},"width":1868,"height":316,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-4.png","element":"img"}],[{"text":"where we use H¨older’s inequality to obtain each inequality.","element":"span"}],[{"style":{"width":"55%"},"width":1047,"height":77,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-5.png","element":"img"}],[{"text":"Recall the definition of ","element":"span"},{"text":"A ","element":"span"},{"text":":= ","element":"span"},{"style":{"height":18.24},"width":149.2,"height":45.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-6.png","element":"img","alt":" 1′pΓ1p/p","inline":true,"padRight":true},{"text":"and ","element":"span"},{"style":{"height":11.6},"width":30,"height":29,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-7.png","element":"img","alt":" �A","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.24},"width":149.2,"height":45.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-8.png","element":"img","alt":" 1′p�Γ1p/p","inline":true},{"text":", and ¯","element":"span"},{"style":{"height":13.1},"width":50.72,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-9.png","element":"img","alt":"sln","inline":true,"padRight":true},{"text":"is the rate of convergence in Theorem ","element":"span"},{"href":"#id-88","text":"2 ","element":"a"},{"text":"in main text, and defined in Assumption ","element":"span"},{"href":"#id-12","text":"7 ","element":"a"},{"text":"with the property ¯","element":"span"},{"style":{"height":13.11},"width":103.36,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-10.png","element":"img","alt":"sln →","inline":true,"padRight":true},{"text":"0.","element":"span"}],[{"id":"id-173","style":{"width":"99%"},"width":1868,"height":444,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-11.png","element":"img"}],[{"text":"where H¨older’s inequality is used in the first inequality, Lemma ","element":"span"},{"href":"#id-89","text":"B.1 ","element":"a"},{"text":"is used for the second inequality, and the last equality is obtained by using Theorem ","element":"span"},{"href":"#id-88","text":"2 ","element":"a"},{"text":"and imposing Assumption ","element":"span"},{"href":"#id-12","text":"7.","element":"a"}],[{"style":{"width":"63%"},"width":1183,"height":77,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-12.png","element":"img"}],[{"text":"Before the next Lemma, we define ","element":"span"},{"style":{"height":10.8},"width":31,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-13.png","element":"img","alt":"�F","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":18.43},"width":139.12,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-14.png","element":"img","alt":" 1′p�Γ�µ/p","inline":true},{"text":", and ","element":"span"},{"text":"F ","element":"span"},{"text":":= ","element":"span"},{"style":{"height":18.43},"width":139.12,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-15.png","element":"img","alt":" 1′pΓµ/p","inline":true},{"text":".","element":"span"}],[{"id":"id-64","style":{"width":"63%"},"width":1186,"height":161,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-16.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-64","text":"B.3","element":"a"},{"text":". We can decompose ","element":"span"},{"style":{"height":10.8},"width":31,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-17.png","element":"img","alt":"�F","inline":true,"padRight":true},{"text":"by simple addition and subtraction into","element":"span"}],[{"style":{"width":"65%"},"width":1230,"height":199,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-18.png","element":"img"}],[{"text":"Now, we analyze each of the terms above.","element":"span"}],[{"style":{"width":"83%"},"width":1572,"height":259,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/70-19.png","element":"img"}],[{"text":"where we use H¨older’s inequality in the first inequality and Lemma ","element":"span"},{"href":"#id-89","text":"B.1 ","element":"a"},{"text":"in the second inequality above, and","element":"span"}],[{"style":{"width":"99%"},"width":1872,"height":342,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-0.png","element":"img"}],[{"text":"where for the rates we use ","element":"span"},{"href":"#id-161","text":"(A.96)","element":"a"},{"href":"#id-162","text":"(A.97) ","element":"a"},{"text":"and since ","element":"span"},{"text":"K ","element":"span"},{"text":"is nondecreasing in ","element":"span"},{"text":"n","element":"span"},{"text":". Note that ","element":"span"},{"style":{"height":16},"width":156.96,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-1.png","element":"img","alt":" µ = E[yt","inline":true},{"text":"] = ","element":"span"},{"style":{"height":16},"width":105.6,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-2.png","element":"img","alt":" BE[ft","inline":true},{"text":"].","element":"span"}],[{"text":"So with ","element":"span"},{"style":{"height":15.51},"width":56.84,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-3.png","element":"img","alt":" bj,k","inline":true,"padRight":true},{"text":"representing ","element":"span"},{"text":"j, k","element":"span"},{"text":"th element of ","element":"span"},{"style":{"height":14},"width":158.36,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-4.png","element":"img","alt":" p × k: B","inline":true,"padRight":true},{"text":"matrix, and ","element":"span"},{"style":{"height":16.71},"width":95.72,"height":41.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-5.png","element":"img","alt":" E[ft,k","inline":true},{"text":"] representing ","element":"span"},{"text":"k","element":"span"},{"text":"th element of ","element":"span"},{"style":{"height":10.8},"width":77.56,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-6.png","element":"img","alt":" K ×","inline":true,"padRight":true},{"text":"1 vector ","element":"span"},{"style":{"height":16},"width":81.12,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-7.png","element":"img","alt":" E(f t","inline":true},{"text":") we have that","element":"span"}],[{"style":{"width":"76%"},"width":1430,"height":199,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-8.png","element":"img"}],[{"text":"where the rate is by Assumption ","element":"span"},{"href":"#id-41","text":"4, ","element":"a"},{"href":"#id-48","text":"6. ","element":"a"},{"text":"Therefore, we consider ","element":"span"},{"href":"#id-206","text":"(B.4) ","element":"a"},{"text":"above.","element":"span"}],[{"style":{"width":"64%"},"width":1210,"height":56,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-9.png","element":"img"}],[{"text":"where we use the same analysis that leads to ","element":"span"},{"href":"#id-92","text":"(B.6)","element":"a"},{"text":", and the rate is from Theorem ","element":"span"},{"href":"#id-88","text":"2, ","element":"a"},{"href":"#id-94","text":"(B.8)","element":"a"},{"text":". Now consider ","element":"span"},{"href":"#id-206","text":"(B.5)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"83%"},"width":1564,"height":251,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-10.png","element":"img"}],[{"text":"where we use H¨older’s inequality in the first inequality and Lemma ","element":"span"},{"href":"#id-89","text":"B.1 ","element":"a"},{"text":"in the second inequality above, and the rate is from Theorem ","element":"span"},{"href":"#id-88","text":"2, ","element":"a"},{"href":"#id-207","text":"(B.7)","element":"a"},{"text":". Combine ","element":"span"},{"href":"#id-92","text":"(B.6)","element":"a"},{"href":"#id-94","text":"(B.9)","element":"a"},{"href":"#id-93","text":"(B.10) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-206","text":"(B.3)","element":"a"},{"text":"-","element":"span"},{"href":"#id-206","text":"(B.5)","element":"a"},{"text":", and note that the largest rate is coming from ","element":"span"},{"href":"#id-94","text":"(B.9) ","element":"a"},{"text":"by ¯","element":"span"},{"style":{"height":13.1},"width":50.72,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-11.png","element":"img","alt":"sln","inline":true,"padRight":true},{"text":"definition in Assumption ","element":"span"},{"href":"#id-12","text":"7.","element":"a"}],[{"style":{"width":"56%"},"width":1061,"height":80,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-12.png","element":"img"}],[{"text":"Note that ","element":"span"},{"text":"D ","element":"span"},{"text":":= ","element":"span"},{"style":{"height":16},"width":137.68,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-13.png","element":"img","alt":" µ′Γµ/p","inline":true},{"text":", and its estimator is ","element":"span"},{"style":{"height":10.8},"width":33,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-14.png","element":"img","alt":"�D","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":16},"width":137.2,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-15.png","element":"img","alt":" �µ′�Γ�µ/p","inline":true},{"text":".","element":"span"}],[{"id":"id-120","style":{"width":"63%"},"width":1198,"height":134,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-16.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-120","text":"B.4","element":"a"},{"text":". By simple addition and subtraction,","element":"span"}],[{"style":{"width":"68%"},"width":1286,"height":337,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/71-17.png","element":"img"}],[{"text":"Consider the first right side term above","element":"span"}],[{"style":{"width":"87%"},"width":1629,"height":258,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/72-0.png","element":"img"}],[{"text":"where H¨older’s inequality is used for the first inequality above, and the inequality Lemma ","element":"span"},{"href":"#id-89","text":"B.1 ","element":"a"},{"text":"for the second inequality above, and for the rates we use Theorem ","element":"span"},{"href":"#id-88","text":"2. ","element":"a"},{"text":"We continue with ","element":"span"},{"href":"#id-95","text":"(B.12)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"87%"},"width":1634,"height":251,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/72-1.png","element":"img"}],[{"text":"where H¨older’s inequality is used for the first inequality above, and the inequality Lemma ","element":"span"},{"href":"#id-89","text":"B.1 ","element":"a"},{"text":"for the second","element":"span"}],[{"text":"inequality above, and for the rates, we use Theorem ","element":"span"},{"href":"#id-88","text":"2 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-207","text":"(B.7)","element":"a"},{"text":". Then, we consider ","element":"span"},{"href":"#id-95","text":"(B.13)","element":"a"}],[{"style":{"width":"86%"},"width":1628,"height":250,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/72-2.png","element":"img"}],[{"text":"where H¨older’s inequality is used for the first inequality above, and the inequality Lemma ","element":"span"},{"href":"#id-89","text":"B.1 ","element":"a"},{"text":"for the second inequality above, and for the rates, we use Theorem ","element":"span"},{"href":"#id-88","text":"2 ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-94","text":"(B.8)","element":"a"},{"text":". Then, we consider ","element":"span"},{"href":"#id-95","text":"(B.14)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"85%"},"width":1597,"height":348,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/72-3.png","element":"img"}],[{"text":"where H¨older’s inequality is used for the first inequality above, and the inequality Lemma ","element":"span"},{"href":"#id-89","text":"B.1 ","element":"a"},{"text":"for the second inequality above, for the third inequality above, we use ","element":"span"},{"href":"#id-94","text":"(B.8)","element":"a"},{"text":", and for the rates, we use Theorem ","element":"span"},{"href":"#id-88","text":"2. ","element":"a"},{"text":"Then, we consider ","element":"span"},{"href":"#id-95","text":"(B.15)","element":"a"},{"text":":","element":"span"}],[{"style":{"width":"72%"},"width":1360,"height":298,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/72-4.png","element":"img"}],[{"text":"where H¨older’s inequality is used for the first inequality above, and the inequality Lemma ","element":"span"},{"href":"#id-89","text":"B.1 ","element":"a"},{"text":"for the second","element":"span"}],[{"text":"inequality above, for the third inequality above, we use ","element":"span"},{"href":"#id-94","text":"(B.8)","element":"a"},{"text":", and for the rate, we use Theorem ","element":"span"},{"href":"#id-88","text":"2. ","element":"a"},{"text":"Note that in ","element":"span"},{"href":"#id-95","text":"(B.11)","element":"a"},{"text":"-","element":"span"},{"href":"#id-95","text":"(B.15) ","element":"a"},{"text":"the rate in ","element":"span"},{"href":"#id-100","text":"(B.20) ","element":"a"},{"text":"is the slowest due to ","element":"span"},{"style":{"height":13.11},"width":32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/72-5.png","element":"img","alt":" ln","inline":true,"padRight":true},{"text":"definition in ","element":"span"},{"href":"#id-12","text":"(22) ","element":"a"},{"text":"to obtain","element":"span"}],[{"style":{"width":"63%"},"width":1195,"height":52,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/72-6.png","element":"img"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-0.png","element":"img"}],[{"text":"The following lemma establishes orders for the terms in the optimal weight, A, B, D. Note that both ","element":"span"},{"text":"A, D ","element":"span"},{"text":"are positive by Assumption 2 and uniformly bounded away from zero.","element":"span"}],[{"id":"id-105","style":{"width":"56%"},"width":1066,"height":209,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-1.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-105","text":"B.5","element":"a"},{"text":". Note that ","element":"span"},{"style":{"height":18.24},"width":275.32,"height":45.6,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-2.png","element":"img","alt":" A = 1′pΓ1p/p ≤","inline":true,"padRight":true},{"text":"Eigmax(","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-3.png","element":"img","alt":"Γ","inline":true},{"text":"). Then by p.221 of ","element":"span"},{"href":"#id-208","referenceIndex":1,"text":"Abadir and Magnus ","element":"a"},{"href":"#id-208","referenceIndex":1,"text":"(2005)","element":"a"},{"text":", ","element":"span"},{"text":"(Exercise 8.27.b in ","element":"span"},{"href":"#id-208","referenceIndex":1,"text":"Abadir and Magnus ","element":"a"},{"href":"#id-208","referenceIndex":1,"text":"(2005)","element":"a"},{"href":"#id-208","referenceIndex":1,"text":"), ","element":"a"},{"style":{"height":10.8},"width":68.12,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-4.png","element":"img","alt":" ΩB","inline":true},{"text":"[","element":"span"},{"href":"#id-208","referenceIndex":1,"text":"(cov(","element":"a"},{"style":{"height":17.36},"width":112.96,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-5.png","element":"img","alt":"f t))−1","inline":true,"padRight":true},{"text":"+ ","element":"span"},{"style":{"height":17.36},"width":251.88,"height":43.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-6.png","element":"img","alt":" B′ΩB]−1B′Ω","inline":true,"padRight":true},{"text":"is positive semidefinite,","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":514,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-7.png","element":"img"}],[{"text":"by (6.3) of ","element":"span"},{"href":"#id-209","text":"Fan et al. ","element":"a"},{"href":"#id-209","text":"(2008)","element":"a"},{"text":", ","element":"span"},{"style":{"height":19.31},"width":100.4,"height":48.28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-8.png","element":"img","alt":" ∥B∥2l2","inline":true,"padRight":true},{"text":"= ","element":"span"},{"text":"O","element":"span"},{"text":"(","element":"span"},{"text":"p","element":"span"},{"text":") under Assumption ","element":"span"},{"href":"#id-48","text":"6, ","element":"a"},{"text":"and by Assumption ","element":"span"},{"href":"#id-41","text":"4, ","element":"a"},{"style":{"height":17.39},"width":145.6,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-9.png","element":"img","alt":" ∥E[f t]∥22","inline":true,"padRight":true},{"text":"= ","element":"span"},{"text":"O","element":"span"},{"text":"(","element":"span"},{"text":"K","element":"span"},{"text":"), ","element":"span"},{"text":"since ","element":"span"},{"style":{"height":14.83},"width":151,"height":37.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-10.png","element":"img","alt":" f t : K ×","inline":true,"padRight":true},{"text":"1 vector of factors. By ","element":"span"},{"href":"#id-210","text":"(B.22)","element":"a"},{"href":"#id-210","text":"(B.23)","element":"a"}],[{"style":{"width":"73%"},"width":1372,"height":46,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-11.png","element":"img"}],[{"text":"For the term F, the proof can be obtained by using the Cauchy-Schwartz inequality first and the same analysis as for terms A and D.","element":"span"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-12.png","element":"img"}],[{"text":"Next, we need the following technical lemma, which provides the limit and the rate for the denominator in the optimal portfolio.","element":"span"}],[{"id":"id-111","style":{"width":"72%"},"width":1357,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-13.png","element":"img"}],[{"text":"Proof of Lemma ","element":"span"},{"href":"#id-111","text":"B.6","element":"a"},{"text":". Note that by simple addition and subtraction,","element":"span"}],[{"style":{"width":"54%"},"width":1026,"height":51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-14.png","element":"img"}],[{"text":"Then, using this last expression and simplifying, ","element":"span"},{"text":"A, D ","element":"span"},{"text":"being both positive,","element":"span"}],[{"style":{"width":"88%"},"width":1657,"height":339,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-15.png","element":"img"}],[{"text":"where we use ","element":"span"},{"href":"#id-122","text":"(B.2)","element":"a"},{"text":", Lemma ","element":"span"},{"href":"#id-64","text":"B.3, ","element":"a"},{"href":"#id-211","text":"(B.21)","element":"a"},{"text":", Lemma ","element":"span"},{"href":"#id-105","text":"B.5, ","element":"a"},{"text":"and Assumption ","element":"span"},{"href":"#id-58","text":"8.","element":"a"}],[{"style":{"width":"7%"},"width":134,"height":36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/73-16.png","element":"img"}]]},{"heading":"Supplement C","paragraphs":[[{"text":"This part covers the proofs for Corollaries 1-3 in the main text. ","element":"span"},{"text":"Proof of Corollary 1","element":"span"},{"text":". Rewrite the ratio of the Sharpe Ratio estimate to its target in the following way","element":"span"}],[{"style":{"width":"99%"},"width":1868,"height":173,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-0.png","element":"img"}],[{"text":"Consider the numerator in ","element":"span"},{"href":"#id-122","text":"(C.1)","element":"a"},{"text":".","element":"span"}],[{"style":{"width":"39%"},"width":743,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-1.png","element":"img"}],[{"text":"Then by Holder’s inequality and Lemma B.1","element":"span"}],[{"style":{"width":"81%"},"width":1519,"height":339,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-2.png","element":"img"}],[{"text":"and the rates are by ","element":"span"},{"href":"#id-94","text":"(B.8)","element":"a"},{"text":", Theorem 2. Since ","element":"span"},{"style":{"height":18.43},"width":285.4,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-3.png","element":"img","alt":" |1′pΓµ|/p ≥ C >","inline":true,"padRight":true},{"text":"0 by Assumption, using ","element":"span"},{"href":"#id-122","text":"(C.2) ","element":"a"},{"text":"we have","element":"span"}],[{"style":{"width":"63%"},"width":1183,"height":112,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-4.png","element":"img"}],[{"text":"Analyze the denominator in ","element":"span"},{"href":"#id-122","text":"(C.1)","element":"a"},{"text":",","element":"span"}],[{"style":{"width":"73%"},"width":1372,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-5.png","element":"img"}],[{"text":"Next, see that by adding and subtracting and via triangle inequality in ","element":"span"},{"href":"#id-212","text":"(C.4) ","element":"a"},{"text":"numerator","element":"span"}],[{"style":{"width":"77%"},"width":1451,"height":193,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-6.png","element":"img"}],[{"text":"Consider the first term in right side of ","element":"span"},{"href":"#id-212","text":"(C.5)","element":"a"}],[{"style":{"width":"84%"},"width":1580,"height":240,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-7.png","element":"img"}],[{"text":"by Theorem 2, ","element":"span"},{"href":"#id-198","text":"(A.108)","element":"a"},{"text":", and Assumption 9. Then in ","element":"span"},{"href":"#id-212","text":"(C.5)","element":"a"},{"text":", take the second right side term, with ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-8.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-9.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":",","element":"span"}],[{"style":{"width":"78%"},"width":1475,"height":193,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/74-10.png","element":"img"}],[{"text":"by Lemma B.2, Assumption 9. Use ","element":"span"},{"href":"#id-92","text":"(C.6)","element":"a"},{"href":"#id-207","text":"(C.7) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-212","text":"(C.4)","element":"a"},{"href":"#id-212","text":"(C.5) ","element":"a"},{"text":"by Assumption 8(ii)","element":"span"}],[{"id":"id-94","style":{"width":"99%"},"width":1866,"height":355,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-0.png","element":"img"}],[{"text":"and","element":"span"}],[{"id":"id-93","style":{"width":"64%"},"width":1209,"height":41,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-1.png","element":"img"}],[{"text":"The estimate of the portfolio return","element":"span"}],[{"id":"id-95","style":{"width":"65%"},"width":1231,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-2.png","element":"img"}],[{"text":"The target portfolio return","element":"span"}],[{"id":"id-97","style":{"width":"65%"},"width":1230,"height":86,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-3.png","element":"img"}],[{"text":"The estimate of variance of the portfolio, with ","element":"span"},{"style":{"height":15.5},"width":49.12,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-4.png","element":"img","alt":" Σy","inline":true,"padRight":true},{"text":"constant, is","element":"span"}],[{"style":{"width":"84%"},"width":1579,"height":156,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-5.png","element":"img"}],[{"text":"Target variance is","element":"span"}],[{"id":"id-106","style":{"width":"84%"},"width":1580,"height":149,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-6.png","element":"img"}],[{"text":"Start with the estimate of square of the Sharpe Ratio:","element":"span"}],[{"style":{"width":"78%"},"width":1469,"height":161,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-7.png","element":"img"}],[{"text":"Then the target Sharpe Ratio is:","element":"span"}],[{"style":{"width":"77%"},"width":1446,"height":133,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-8.png","element":"img"}],[{"text":"Take the ratio of the estimate to the target Sharpe Ratio, and scaling variances by ","element":"span"},{"text":"p","element":"span"}],[{"id":"id-213","style":{"width":"113%"},"width":2126,"height":307,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/75-9.png","element":"img"}],[{"text":"Start with the terms in numerator in ","element":"span"},{"href":"#id-213","text":"(C.15) ","element":"a"},{"text":"which will be upper bounded by","element":"span"}],[{"id":"id-101","style":{"width":"88%"},"width":1663,"height":125,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-0.png","element":"img"}],[{"text":"First term on the right side of ","element":"span"},{"href":"#id-101","text":"(C.16)","element":"a"},{"text":", and ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-1.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":"is symmetric","element":"span"}],[{"id":"id-98","style":{"width":"76%"},"width":1423,"height":126,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-2.png","element":"img"}],[{"text":"Take the first term on the right side of ","element":"span"},{"href":"#id-98","text":"(C.17)","element":"a"}],[{"id":"id-99","style":{"width":"71%"},"width":1337,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-3.png","element":"img"}],[{"text":"Then by Lemma B.3-B.6","element":"span"}],[{"id":"id-192","style":{"width":"73%"},"width":1373,"height":74,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-4.png","element":"img"}],[{"text":"Next by ","element":"span"},{"href":"#id-122","text":"(C.2)","element":"a"},{"text":", Lemma B.3, ","element":"span"},{"text":"F ","element":"span"},{"text":":= ","element":"span"},{"style":{"height":18.43},"width":139.12,"height":46.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-5.png","element":"img","alt":" 1′pΓµ/p","inline":true}],[{"id":"id-100","style":{"width":"71%"},"width":1331,"height":169,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-6.png","element":"img"}],[{"text":"where the last equality is by Assumption 8, ","element":"span"},{"style":{"height":13.1},"width":140.32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-7.png","element":"img","alt":" K¯sln →","inline":true,"padRight":true},{"text":"0. Combine ","element":"span"},{"href":"#id-192","text":"(C.19)","element":"a"},{"href":"#id-100","text":"(C.20) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-99","text":"(C.18)","element":"a"}],[{"id":"id-211","style":{"width":"68%"},"width":1290,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-8.png","element":"img"}],[{"text":"Then consider the second term on right side of ","element":"span"},{"href":"#id-98","text":"(C.17)","element":"a"}],[{"id":"id-210","style":{"width":"99%"},"width":1868,"height":336,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-9.png","element":"img"}],[{"text":"and ","element":"span"},{"href":"#id-122","text":"(C.2)","element":"a"},{"text":". So use ","element":"span"},{"href":"#id-211","text":"(C.21)","element":"a"},{"href":"#id-210","text":"(C.22) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-98","text":"(C.17)","element":"a"}],[{"id":"id-102","style":{"width":"68%"},"width":1284,"height":54,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-10.png","element":"img"}],[{"text":"Consider the second term in ","element":"span"},{"href":"#id-101","text":"(C.16)","element":"a"}],[{"id":"id-214","style":{"width":"78%"},"width":1474,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-11.png","element":"img"}],[{"text":"Take the first term on the right side of ","element":"span"},{"href":"#id-214","text":"(C.25)","element":"a"}],[{"id":"id-104","style":{"width":"72%"},"width":1348,"height":106,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/76-12.png","element":"img"}],[{"text":"Analyze ","element":"span"},{"href":"#id-104","text":"(C.26) ","element":"a"},{"text":"in the same way as in ","element":"span"},{"href":"#id-192","text":"(C.19) ","element":"a"},{"text":"use, ","element":"span"},{"href":"#id-93","text":"(C.10)","element":"a"},{"text":", Lemma B.2-B.3,","element":"span"}],[{"id":"id-103","style":{"width":"59%"},"width":1116,"height":42,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-0.png","element":"img"}],[{"text":"Then use ","element":"span"},{"text":"D ","element":"span"},{"text":":= ","element":"span"},{"style":{"height":16},"width":126.16,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-1.png","element":"img","alt":" µ′Γµ/p","inline":true},{"text":", and Lemma B.5 ","element":"span"},{"text":"D ","element":"span"},{"text":"= ","element":"span"},{"text":"O","element":"span"},{"text":"(","element":"span"},{"text":"K","element":"span"},{"text":") with ","element":"span"},{"href":"#id-191","text":"(A.145)","element":"a"},{"href":"#id-193","text":"(A.146)","element":"a"},{"text":", by Assumption 8","element":"span"}],[{"style":{"width":"27%"},"width":512,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-2.png","element":"img"}],[{"text":"Next use the last two rates in ","element":"span"},{"href":"#id-104","text":"(C.26)","element":"a"}],[{"id":"id-107","style":{"width":"69%"},"width":1303,"height":50,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-3.png","element":"img"}],[{"text":"Then take the second term on the right side in ","element":"span"},{"href":"#id-214","text":"(C.25)","element":"a"}],[{"id":"id-109","style":{"width":"99%"},"width":1868,"height":269,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-4.png","element":"img"}],[{"text":"by Lemma B.5, and ","element":"span"},{"style":{"height":9.1},"width":33.76,"height":22.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-5.png","element":"img","alt":" r2","inline":true,"padRight":true},{"text":"definition, and we use ","element":"span"},{"href":"#id-193","text":"(A.146) ","element":"a"},{"text":"for the other rate in ","element":"span"},{"href":"#id-107","text":"(C.29)","element":"a"},{"text":". Combine ","element":"span"},{"href":"#id-107","text":"(C.28)","element":"a"},{"href":"#id-107","text":"(C.29) ","element":"a"},{"text":"for ","element":"span"},{"href":"#id-214","text":"(C.25)","element":"a"}],[{"id":"id-108","style":{"width":"70%"},"width":1312,"height":49,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-6.png","element":"img"}],[{"text":"Use ","element":"span"},{"href":"#id-102","text":"(C.24)","element":"a"},{"href":"#id-108","text":"(C.31) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-101","text":"(C.16)","element":"a"},{"text":", by Assumption 8","element":"span"}],[{"style":{"width":"86%"},"width":1616,"height":394,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-7.png","element":"img"}],[{"text":"Next numerator in ","element":"span"},{"href":"#id-213","text":"(C.15) ","element":"a"},{"text":"can be written (without squaring)","element":"span"}],[{"style":{"width":"96%"},"width":1806,"height":282,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-8.png","element":"img"}],[{"text":"We analyze the terms in the denominator of ","element":"span"},{"href":"#id-213","text":"(C.15)","element":"a"}],[{"id":"id-117","style":{"width":"81%"},"width":1532,"height":391,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/77-9.png","element":"img"}],[{"text":"In ","element":"span"},{"href":"#id-117","text":"(C.33) ","element":"a"},{"text":"consider the first term on the right side by adding and subtracting","element":"span"}],[{"id":"id-123","style":{"width":"89%"},"width":1674,"height":280,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-0.png","element":"img"}],[{"text":"In ","element":"span"},{"href":"#id-123","text":"(C.34) ","element":"a"},{"text":"the first right side term will be considered by adding and subtracting","element":"span"}],[{"id":"id-110","style":{"width":"85%"},"width":1598,"height":280,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-1.png","element":"img"}],[{"text":"In ","element":"span"},{"href":"#id-110","text":"(C.35)","element":"a"},{"text":", by ","element":"span"},{"href":"#id-192","text":"(C.19)","element":"a"},{"text":", and by Lemma B.5 ","element":"span"},{"style":{"height":16},"width":193.92,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-2.png","element":"img","alt":" |r1| = O(K","inline":true},{"text":"), and Assumption 8","element":"span"}],[{"id":"id-215","style":{"width":"71%"},"width":1332,"height":119,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-3.png","element":"img"}],[{"text":"Using ","element":"span"},{"href":"#id-212","text":"(C.5)","element":"a"},{"text":"-","element":"span"},{"href":"#id-207","text":"(C.7) ","element":"a"},{"text":"with ","element":"span"},{"href":"#id-215","text":"(C.36) ","element":"a"},{"text":"in the first term on the right side of ","element":"span"},{"href":"#id-110","text":"(C.35)","element":"a"},{"text":",","element":"span"}],[{"id":"id-112","style":{"width":"80%"},"width":1515,"height":145,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-4.png","element":"img"}],[{"text":"Next use Lemma B.5 with ","element":"span"},{"href":"#id-215","text":"(C.36) ","element":"a"},{"text":"on the second right side term in ","element":"span"},{"href":"#id-110","text":"(C.35)","element":"a"}],[{"id":"id-113","style":{"width":"66%"},"width":1254,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-5.png","element":"img"}],[{"text":"In ","element":"span"},{"href":"#id-123","text":"(C.34) ","element":"a"},{"text":"the first right side term, use ","element":"span"},{"href":"#id-112","text":"(C.37)","element":"a"},{"href":"#id-113","text":"(C.38)","element":"a"},{"text":", and since ¯","element":"span"},{"style":{"height":14.83},"width":77.08,"height":37.08,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-6.png","element":"img","alt":"sl=n o","inline":true},{"text":"(1) by Assumption 9","element":"span"}],[{"id":"id-216","style":{"width":"67%"},"width":1271,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-7.png","element":"img"}],[{"text":"Analyze the second term on the right side of ","element":"span"},{"href":"#id-123","text":"(C.34)","element":"a"},{"text":", by ","element":"span"},{"href":"#id-212","text":"(C.5)","element":"a"},{"text":"-","element":"span"},{"href":"#id-207","text":"(C.7) ","element":"a"},{"text":"with Lemma B.5 ","element":"span"},{"style":{"height":16},"width":196.32,"height":40,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-8.png","element":"img","alt":" |r1| = O(K","inline":true},{"text":"), and","element":"span"}],[{"id":"id-51","style":{"width":"99%"},"width":1868,"height":212,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-9.png","element":"img"}],[{"text":"Clearly by ","element":"span"},{"href":"#id-216","text":"(C.39)","element":"a"},{"href":"#id-51","text":"(C.40) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-123","text":"(C.34)","element":"a"},{"text":", by Assumption 8","element":"span"}],[{"id":"id-52","style":{"width":"78%"},"width":1464,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-10.png","element":"img"}],[{"text":"In ","element":"span"},{"href":"#id-117","text":"(C.33) ","element":"a"},{"text":"consider the second term on the right side which will be upper bounded by adding and sub-","element":"span"}],[{"text":"tracting and triangle inequality","element":"span"}],[{"id":"id-217","style":{"width":"83%"},"width":1564,"height":257,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/78-11.png","element":"img"}],[{"text":"In ","element":"span"},{"href":"#id-217","text":"(C.42) ","element":"a"},{"text":"the first term on the right side will be analyzed by adding and subtracting","element":"span"}],[{"id":"id-218","style":{"width":"78%"},"width":1477,"height":234,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-0.png","element":"img"}],[{"text":"In ","element":"span"},{"href":"#id-218","text":"(C.43) ","element":"a"},{"text":"consider by ","element":"span"},{"href":"#id-103","text":"(C.27)","element":"a"},{"href":"#id-109","text":"(C.30) ","element":"a"},{"text":"and Assumption 8","element":"span"}],[{"id":"id-219","style":{"width":"72%"},"width":1365,"height":118,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-1.png","element":"img"}],[{"text":"Then by ","element":"span"},{"href":"#id-219","text":"(C.44)","element":"a"},{"href":"#id-202","text":"(A.162) ","element":"a"},{"text":"on the first right side term ","element":"span"},{"href":"#id-218","text":"(C.43)","element":"a"}],[{"id":"id-116","style":{"width":"86%"},"width":1611,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-2.png","element":"img"}],[{"text":"Then using Lemma B.5 for second term on the right side of ","element":"span"},{"href":"#id-218","text":"(C.43) ","element":"a"},{"text":"in combination with ","element":"span"},{"href":"#id-219","text":"(C.44)","element":"a"},{"href":"#id-116","text":"(C.45) ","element":"a"},{"text":"in","element":"span"}],[{"href":"#id-218","text":"(C.43)","element":"a"}],[{"id":"id-114","style":{"width":"85%"},"width":1606,"height":117,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-3.png","element":"img"}],[{"text":"since ","element":"span"},{"style":{"height":16.46},"width":211,"height":41.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-4.png","element":"img","alt":" K5/2¯sln = o","inline":true},{"text":"(1) by Assumption 8. Next consider the second term on right side of ","element":"span"},{"href":"#id-217","text":"(C.42)","element":"a"},{"text":", with ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-5.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-6.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":",","element":"span"}],[{"text":"and ","element":"span"},{"href":"#id-109","text":"(C.30) ","element":"a"},{"text":"with ","element":"span"},{"href":"#id-116","text":"(C.45)","element":"a"}],[{"id":"id-115","style":{"width":"93%"},"width":1745,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-7.png","element":"img"}],[{"text":"by Assumption 8. Use ","element":"span"},{"href":"#id-114","text":"(C.46)","element":"a"},{"href":"#id-115","text":"(C.47) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-217","text":"(C.42) ","element":"a"},{"text":"to have","element":"span"}],[{"id":"id-220","style":{"width":"93%"},"width":1745,"height":120,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-8.png","element":"img"}],[{"text":"by Assumption 8.","element":"span"}],[{"text":"Consider the third right side term in ","element":"span"},{"href":"#id-117","text":"(C.33)","element":"a"},{"text":", by adding and subtracting and triangle inequality","element":"span"}],[{"id":"id-119","style":{"width":"89%"},"width":1680,"height":258,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-9.png","element":"img"}],[{"text":"Consider the first right side term in ","element":"span"},{"href":"#id-119","text":"(C.49)","element":"a"}],[{"id":"id-121","style":{"width":"84%"},"width":1577,"height":257,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/79-10.png","element":"img"}],[{"text":"In ","element":"span"},{"href":"#id-121","text":"(C.50)","element":"a"},{"text":", by ","element":"span"},{"id":"id-126","href":"#id-192","text":"(C.19)","element":"a"},{"href":"#id-210","text":"(C.23)","element":"a"},{"href":"#id-103","text":"(C.27)","element":"a"},{"href":"#id-109","text":"(C.30) ","element":"a"},{"text":"and Assumption 8","element":"span"}],[{"id":"id-125","style":{"width":"99%"},"width":1868,"height":781,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/80-0.png","element":"img"}],[{"text":"Consider the first term on the right side of ","element":"span"},{"href":"#id-125","text":"(C.52) ","element":"a"},{"text":"via Cauchy Schwartz inequality","element":"span"}],[{"id":"id-124","style":{"width":"99%"},"width":1854,"height":797,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/80-1.png","element":"img"}],[{"text":"by the same analysis in ","element":"span"},{"href":"#id-94","text":"(B.9)","element":"a"},{"text":". Fourth term on the right side of ","element":"span"},{"href":"#id-125","text":"(C.52) ","element":"a"},{"text":"can use the same analysis in ","element":"span"},{"href":"#id-92","text":"(B.6)","element":"a"},{"text":", ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/80-2.png","element":"img","alt":"Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/80-3.png","element":"img","alt":" Σ−1y","inline":true}],[{"id":"id-127","style":{"width":"92%"},"width":1734,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/80-4.png","element":"img"}],[{"text":"Then fifth term on the right side of ","element":"span"},{"href":"#id-125","text":"(C.52) ","element":"a"},{"text":"is","element":"span"}],[{"style":{"width":"91%"},"width":1722,"height":123,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/80-5.png","element":"img"}],[{"text":"by ","element":"span"},{"href":"#id-92","text":"(B.6)","element":"a"},{"text":".","element":"span"}],[{"id":"id-128","style":{"width":"99%"},"width":1868,"height":426,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-0.png","element":"img"}],[{"text":"by ","element":"span"},{"href":"#id-93","text":"(B.10)","element":"a"},{"text":". ","element":"span"},{"text":"Among all right side terms in ","element":"span"},{"href":"#id-125","text":"(C.52)","element":"a"},{"text":", the slowest rate are ","element":"span"},{"href":"#id-124","text":"(C.55)","element":"a"},{"href":"#id-128","text":"(C.58)","element":"a"},{"text":", as can be seen by","element":"span"}],[{"id":"id-129","style":{"width":"99%"},"width":1867,"height":193,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-1.png","element":"img"}],[{"text":"Now, combine ","element":"span"},{"href":"#id-121","text":"(C.50)","element":"a"},{"href":"#id-126","text":"(C.51)","element":"a"},{"href":"#id-129","text":"(C.60) ","element":"a"},{"text":"in the first right side term ","element":"span"},{"href":"#id-119","text":"(C.49)","element":"a"},{"text":", with ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-2.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-3.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":", Lemma B.5 (i.e. ","element":"span"},{"style":{"height":18.16},"width":238.24,"height":45.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-4.png","element":"img","alt":"|F| = O(K1/2","inline":true},{"text":"))","element":"span"}],[{"id":"id-96","style":{"width":"95%"},"width":1789,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-5.png","element":"img"}],[{"text":"where the last rate is by Assumption 8. Consider ","element":"span"},{"href":"#id-129","text":"(C.60) ","element":"a"},{"text":"and ","element":"span"},{"style":{"height":18.16},"width":477.76,"height":45.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-6.png","element":"img","alt":" |r1| = O(K), |r2| = O(K1/2","inline":true},{"text":") by ","element":"span"},{"href":"#id-210","text":"(C.23)","element":"a"},{"href":"#id-109","text":"(C.30)","element":"a"},{"text":",","element":"span"}],[{"text":"substituted into second term in right side of ","element":"span"},{"href":"#id-119","text":"(C.49)","element":"a"}],[{"id":"id-131","style":{"width":"83%"},"width":1561,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-7.png","element":"img"}],[{"text":"So by ","element":"span"},{"href":"#id-96","text":"(C.61)","element":"a"},{"href":"#id-131","text":"(C.62) ","element":"a"},{"text":"in left side term in ","element":"span"},{"href":"#id-119","text":"(C.49)","element":"a"},{"text":", by Assumption 8","element":"span"}],[{"id":"id-132","style":{"width":"80%"},"width":1504,"height":121,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-8.png","element":"img"}],[{"text":"Next clearly by ","element":"span"},{"href":"#id-52","text":"(C.41)","element":"a"},{"href":"#id-220","text":"(C.48)","element":"a"},{"href":"#id-132","text":"(C.63) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-117","text":"(C.33)","element":"a"}],[{"id":"id-133","style":{"width":"92%"},"width":1730,"height":256,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-9.png","element":"img"}],[{"text":"Next the denominator in ","element":"span"},{"href":"#id-213","text":"(C.15) ","element":"a"},{"text":"can be written as","element":"span"}],[{"id":"id-135","style":{"width":"77%"},"width":1458,"height":180,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-10.png","element":"img"}],[{"text":"The ratio in ","element":"span"},{"href":"#id-135","text":"(C.65) ","element":"a"},{"text":"is greater than equal to the following term","element":"span"}],[{"style":{"width":"103%"},"width":1943,"height":98,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-11.png","element":"img"}],[{"id":"id-137","text":"1","element":"span"},{"style":{"height":4.4},"width":31,"height":11,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-12.png","element":"img","alt":"−","inline":true}],[{"style":{"width":"68%"},"width":1290,"height":81,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/81-13.png","element":"img"}],[{"text":"By ","element":"span"},{"style":{"height":14.8},"width":232.2,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-0.png","element":"img","alt":" r1, r2, A, F, D","inline":true,"padRight":true},{"text":"definitions, and Assumption ","element":"span"},{"style":{"height":17.39},"width":783.16,"height":43.48,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-1.png","element":"img","alt":" AD − F 2 ≥ C1 > 0, Aρ21 − 2ρ1F + D ≥ C1 >","inline":true,"padRight":true},{"text":"0","element":"span"}],[{"style":{"width":"98%"},"width":1840,"height":379,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-2.png","element":"img"}],[{"text":"See that by ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-3.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.75},"width":74.08,"height":51.88,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-4.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":"and symmetric Γ","element":"span"}],[{"id":"id-222","style":{"width":"66%"},"width":1250,"height":127,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-5.png","element":"img"}],[{"text":"By ","element":"span"},{"href":"#id-205","text":"(A.148) ","element":"a"},{"text":"in the numerator above","element":"span"}],[{"id":"id-139","style":{"width":"62%"},"width":1165,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-6.png","element":"img"}],[{"text":"By ","element":"span"},{"href":"#id-196","text":"(A.150)","element":"a"},{"href":"#id-221","text":"(A.164) ","element":"a"},{"text":"with Assumption 8(ii) in the denominator of ","element":"span"},{"href":"#id-222","text":"(C.67)","element":"a"}],[{"id":"id-223","style":{"width":"62%"},"width":1173,"height":103,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-7.png","element":"img"}],[{"text":"Use ","element":"span"},{"href":"#id-139","text":"(C.68)","element":"a"},{"href":"#id-223","text":"(C.69) ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-222","text":"(C.67) ","element":"a"},{"text":"to get the result.","element":"span"},{"text":"Q.E.D.","element":"span"}]]},{"heading":"Supplement D","paragraphs":[[{"text":"In this part we consider mean-variance efficiency of large portfolio in an out-of-sample context, and also we add a simulation to show the effects of sparsity on our and other methods.","element":"span"}],[{"text":"Mean-Variance Efficiency","element":"span"}],[{"text":"This Supplement formally shows that we can obtain mean-variance efficiency in an out-of-sample context. ","element":"span"},{"href":"#id-16","referenceIndex":2,"text":"Ao et al. ","element":"a"},{"href":"#id-16","referenceIndex":2,"text":"(2019) ","element":"a"},{"text":"show that this is possible when ","element":"span"},{"style":{"height":13.6},"width":113.76,"height":34,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-8.png","element":"img","alt":" p ≤ n","inline":true},{"text":", when both ","element":"span"},{"text":"p","element":"span"},{"text":", and ","element":"span"},{"text":"n ","element":"span"},{"text":"are large. ","element":"span"},{"text":"That article is a significant contribution since they also demonstrate that other methods before theirs could not obtain that result, and it is a difficult issue to address. We are interested in maximized out-of-sample expected return ","element":"span"},{"style":{"height":10.8},"width":132.6,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-9.png","element":"img","alt":" µ′wmos","inline":true,"padRight":true},{"text":"and its estimate ","element":"span"},{"style":{"height":10.8},"width":133.08,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-10.png","element":"img","alt":" µ′ �wmos","inline":true},{"text":". Additionally, we are interested in the out-of-sample variance of the portfolio returns ","element":"span"},{"style":{"height":15.5},"width":234.84,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-11.png","element":"img","alt":" w′mosΣywmos","inline":true,"padRight":true},{"text":"and its estimate ","element":"span"},{"style":{"height":15.5},"width":241.08,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-12.png","element":"img","alt":" �w′mosΣy �wmos","inline":true},{"text":". Note also that by the formula for weights ","element":"span"},{"style":{"height":15.5},"width":241.08,"height":38.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-13.png","element":"img","alt":"w′mosΣywmos","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":13.36},"width":40,"height":33.4,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-14.png","element":"img","alt":" σ2","inline":true},{"text":", given ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-15.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-16.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":".","element":"span"}],[{"text":"Below, we show that our estimates based on nodewise regression are consistent, and furthermore, we also provide the rate of convergence results.","element":"span"}],[{"id":"id-43","style":{"width":"69%"},"width":1309,"height":374,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/82-17.png","element":"img"}],[{"text":"Proof of Theorem ","element":"span"},{"href":"#id-43","text":"D.1","element":"a"},{"text":". (i). Start with definition of weights, and its estimators","element":"span"}],[{"id":"id-122","style":{"width":"99%"},"width":1867,"height":682,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-0.png","element":"img"}],[{"text":"Next, we have","element":"span"}],[{"id":"id-206","style":{"width":"71%"},"width":1342,"height":236,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-1.png","element":"img"}],[{"text":"where we divided both the numerator and denominator by ","element":"span"},{"text":"p","element":"span"},{"text":", and","element":"span"}],[{"style":{"width":"38%"},"width":721,"height":51,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-2.png","element":"img"}],[{"text":"By ","element":"span"},{"href":"#id-194","text":"(A.147)","element":"a"},{"text":",","element":"span"},{"href":"#id-206","text":"(D.3)","element":"a"},{"text":", Lemma B.4 in the Supplement B, and ","element":"span"},{"style":{"height":15.66},"width":179.32,"height":39.16,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-3.png","element":"img","alt":" K2¯sln = o","inline":true},{"text":"(1) via Assumption ","element":"span"},{"href":"#id-58","text":"8 ","element":"a"},{"text":"in the denominator below in ","element":"span"},{"href":"#id-212","text":"(D.4)","element":"a"}],[{"id":"id-212","style":{"width":"99%"},"width":1867,"height":310,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-4.png","element":"img"}],[{"text":"Now, use Assumption ","element":"span"},{"href":"#id-58","text":"8 ","element":"a"},{"text":"in ","element":"span"},{"href":"#id-122","text":"(D.2)","element":"a"},{"href":"#id-212","text":"(D.5) ","element":"a"},{"text":"and ","element":"span"},{"href":"#id-122","text":"(D.1) ","element":"a"},{"text":"to obtain the desired result.","element":"span"}],[{"style":{"width":"7%"},"width":136,"height":37,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-5.png","element":"img"}],[{"text":"(ii). Now, we analyze the risk. See that","element":"span"}],[{"style":{"width":"63%"},"width":1196,"height":169,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-6.png","element":"img"}],[{"text":"where we multiplied and divided by ","element":"span"},{"style":{"height":14},"width":95.2,"height":35,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-7.png","element":"img","alt":" µ′Γµ","inline":true},{"text":", which is positive by ","element":"span"},{"href":"#id-194","text":"(A.147)","element":"a"},{"text":". ","element":"span"},{"text":"By ","element":"span"},{"href":"#id-221","text":"(A.164)","element":"a"},{"text":", since ","element":"span"},{"style":{"height":10.8},"width":28,"height":27,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-8.png","element":"img","alt":" Γ","inline":true,"padRight":true},{"text":":= ","element":"span"},{"style":{"height":20.94},"width":74.08,"height":52.36,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-9.png","element":"img","alt":" Σ−1y","inline":true,"padRight":true},{"text":",","element":"span"}],[{"id":"id-92","style":{"width":"99%"},"width":1867,"height":185,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/83-10.png","element":"img"}],[{"text":"Additionally, by Lemma ","element":"span"},{"href":"#id-120","text":"B.4 ","element":"a"},{"text":"in Supplement B and ","element":"span"},{"href":"#id-194","text":"(A.147)","element":"a"}],[{"id":"id-207","style":{"width":"82%"},"width":1550,"height":122,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/84-0.png","element":"img"}],[{"text":"By ","element":"span"},{"href":"#id-92","text":"(D.6)","element":"a"},{"text":", ","element":"span"},{"href":"#id-207","text":"(D.7) ","element":"a"},{"text":"and Assumption ","element":"span"},{"href":"#id-58","text":"8,","element":"a"}],[{"style":{"width":"68%"},"width":1288,"height":143,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/84-1.png","element":"img"}],[{"text":"Effects of Sparsity","element":"span"}],[{"text":"This section of the Supplement show a small simulation with a Block Diagonal covariance matrix for the idiosyncratic part of the dgp. The dgp is the same from section 5 but with ","element":"span"},{"style":{"height":13.11},"width":56.12,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/84-2.png","element":"img","alt":"�Σn","inline":true,"padRight":true},{"text":"= ","element":"span"},{"style":{"height":13.11},"width":95.32,"height":32.76,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/84-3.png","element":"img","alt":"�Σn ⊙","inline":true,"padRight":true},{"text":"BLDiag(","element":"span"},{"text":"b","element":"span"},{"text":"), where BLDiag(","element":"span"},{"text":"b","element":"span"},{"text":") is the ","element":"span"},{"style":{"height":11.2},"width":95.44,"height":28,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/84-4.png","element":"img","alt":" p × p","inline":true,"padRight":true},{"text":"block diagonal matrix with ","element":"span"},{"text":"b ","element":"span"},{"text":"blocks of ones. Moreover, this simulation was only performed for ","element":"span"},{"text":"n ","element":"span"},{"text":"= 200 and for the plug-in models with block sizes of 5, 15 and 50. The objective is to look at the behavior of Nodewise Regression on different sparsity levels in the covariance matrix. We analyze two questions whether our methods are doing well compared to others when the model is","element":"span"}],[{"text":"less sparse, and then see whether sparsity effects are uniform over analysis of various Sharpe Ratio cases in Section 4.","element":"span"}],[{"text":"First, from Table 6, our methods do well in high dimensional cases, our method has the smallest error in 5 out of 12 cases, POET method has high errors in all cases. In case of low dimensions, non-linear shrinkage is the best method, POET does again poorly. Also in less sparse case of blocks with 50, in high dimensions, we get the least error in 2 cases, and the other 2 cases non-linear shrinkage gets the least errors. Regarding the analysis of our method in various Sharpe Ratio cases, in case of the constrained maximum","element":"span"}],[{"text":"Sharpe Ratio (MSR), our errors are smaller with increased block size. To give an example, NW-GIC has 0.379 in high dimensional case with 5 as block size, and this decreases to 0.100 with block size of 50. In case","element":"span"}],[{"text":"of Markowitz portfolio we see that increasing the block size does not affect our errors much differently. Our method is affected by non-sparsity in maximum out-of-sample Sharpe Ratio as predicted by our Theorem 8.","element":"span"}],[{"text":"Table 6: Simulation Results – Block Diagonal DGP with Real Factors","element":"figcaption","subtype":"caption"}],[{"style":{"width":"100%"},"width":1874,"height":1023,"src":"https://cdn.bytez.com/mobilePapers/v2/arxiv/2002.01800/images/85-0.png","element":"img"}],[{"text":"The table shows the simulation results for the block DGP. Each simulation was done with 100 iterations. We used a single sample size of ","element":"span"},{"text":"n ","element":"span"},{"text":"= 200 and the number of stocks was either ","element":"span"},{"text":"n/","element":"span"},{"text":"2 or 1","element":"span"},{"text":".","element":"span"},{"text":"5","element":"span"},{"text":"n ","element":"span"},{"text":"for the low-dimensional and the high-dimensional case, respectively. Each block of rows shows the results for a different block size (5, 15, 50) in the block diagonal DGP. The values in each cell show the average absolute estimation error for estimating the square of the Sharpe Ratio.","element":"span"}]]}],"_version":"3.3.2"},"paperNode":"$28:props:children:props:children:0:props:product"}]]