Selected technical reports

Mark Tygert's homepage

Selected technical reports

Kamalika Chaudhuri, Chuan Guo, Laurens van der Maaten, Saeed Mahloujifar, and Mark Tygert, "Guarantees of confidentiality via Hammersley-Chapman-Robbins bounds," Technical Report 2404.02866, arXiv, April 2024: pdf.
This article replaces Fisher information privacy with a nearly uniformly superior alternative, using Hammersley-Chapman-Robbins bounds rather than Cramér-Rao.
Isabel Kloumann, Hannah Korevaar, Chris McConnell, Mark Tygert, and Jessica Zhao, "Cumulative differences between paired samples," Technical Report 2305.11323, arXiv, May 2023: pdf.
This article details cumulative statistics for matched pairs.
Mark Tygert, "Controlling for multiple covariates," Technical Report 2112.00672, arXiv, December 2021: pdf.
This article proposes a fully non-parametric method for conditioning on multiple covariates when assessing differences between subpopulations (or between a subpopulation and the full population).
Isabel Kloumann and Mark Tygert, "An optimizable scalar objective value cannot be objective and should not be the sole objective," Technical Report 2006.02577, arXiv, June 2020: pdf.
This article concerns the ethics and morality of algorithms and computational systems, and has been circulating internally at Facebook for the past couple years. The paper reviews many Nobel laureates' work, as well as the work of other prominent scientists such as Richard Dawkins, Andrei Kolmogorov, Vilfredo Pareto, and John Von Neumann. The article argues that the standard approach to modern machine learning and artificial intelligence is bound to be biased and unfair, and that longstanding traditions in the professions of law, justice, politics, and medicine should help.
Mark Tygert, "Poor starting points in machine learning," Technical Report 1602.02823, arXiv, January 2016: pdf.
This article advocates starting with a higher-order method and finishing with a lowest-order method in many settings for machine learning, when generalization is important.
William Perkins, Mark Tygert, and Rachel Ward, "Computer-enabled metrics of statistical significance for discrete data," May 2014: pdf.
This monograph collects together all our work on significance testing.
Mark Tygert and Rachel Ward, "Testing goodness-of-fit for logistic regression," Technical Report 1306.0959, arXiv, June 2013: pdf.
This article resolves many issues with the standard Hosmer-Lemeshow tests.
William Perkins, Mark Tygert, and Rachel Ward, "Significance testing without truth," Technical Report 1301.1208, arXiv, January 2013: pdf.
This article has major antecedents in the work of D. R. Cox.
Jacob Carruth, Mark Tygert, and Rachel Ward, "A comparison of the discrete Kolmogorov-Smirnov statistic and the Euclidean distance," Technical Report 1206.6367, arXiv, June 2012: pdf.
This article provides a guide to choosing between the discrete Kolmogorov-Smirnov statistic and the root-mean-square.
William Perkins, Gary Simon, and Mark Tygert, "Computing the asymptotic power of a Euclidean-distance test for goodness-of-fit," Technical Report 1206.6378, arXiv, June 2012: pdf.
This article provides an efficient numerical method for plotting the asymptotic power function of a root-mean-square test for goodness-of-fit in the limit of large numbers of observations (as a function of the significance level). This follows up on our earlier paper, "χ² and classical exact tests often wildly misreport significance; the remedy lies in computers," which is available below.
Mark Tygert, "Testing the significance of assuming homogeneity in contingency-tables/cross-tabulations," Technical Report 1201.1421, arXiv, January 2012: pdf.
This article analyzes homogeneity in contingency-tables/cross-tabulations using the approach of our earlier paper, "χ² and classical exact tests often wildly misreport significance; the remedy lies in computers," which is available below.
William Perkins, Mark Tygert, and Rachel Ward, "χ² and classical exact tests often wildly misreport significance; the remedy lies in computers," Technical Report 1108.4126, arXiv, August 2011; updated, abbreviated version: pdf; extended version: pdf.
This article is the leading and largest salvo in our crusade against the Pearson χ² test. This is the place to start.
William Perkins, Mark Tygert, and Rachel Ward, "Computing the confidence levels for a root-mean-square test of goodness-of-fit, II," Technical Report 1009.2260, arXiv, September 2010: pdf, ps.
This article extends its predecessor (which is available here); the models in the new paper involve parameter estimation.
Mark Tygert, "Analogues for Bessel functions of the Christoffel-Darboux identity," Technical Report 1351, Yale University, Department of Computer Science, March 2006: pdf, ps.
Many thanks to Professor F. W. J. Olver (R.I.P.) of the University of Maryland for pointing out formula 57.21.1 in E. R. Hansen's A Table of Series and Products, which provides a more general formulation of one of the analogues (thus obviating the need for publishing this technical report).