Statistical Methods¶

This section documents the mathematical foundations behind MORIE’s estimators. The methods are dataset-agnostic — they apply to any suitably-shaped tabular input, including the OTIS placement records, TPS incident feeds, CPADS survey data, and any other dataset that matches the estimator’s signature.

Causal Estimands
- Summary
- Average Treatment Effect (ATE)
- Average Treatment Effect on the Treated (ATT)
- Average Treatment Effect on the Controls (ATC)
- Group Average Treatment Effect (GATE)
- Conditional Average Treatment Effect (CATE)
- Local Average Treatment Effect (LATE)
- Doubly-Robust AIPW (ATE)
- References
Causal Inference
- Potential outcomes framework
- Inverse Probability Weighting (IPW) — Hájek estimator
- Average Treatment Effect on the Treated (ATT)
- Average Treatment Effect on the Controls (ATC)
- Augmented IPW (AIPW) — Doubly Robust
- G-computation (Outcome Regression)
- eBAC-selection-adjusted IPW
- Sensitivity Analysis
- Average Treatment Effect on the Treated (ATT)
- Average Treatment Effect on the Controls (ATC)
- Group Average Treatment Effect (GATE)
- Conditional Average Treatment Effect (CATE)
- Local Average Treatment Effect (LATE / IV)
- Interactive Regression Model (IRM)
- References
Propensity Scores
- Estimation
- Diagnostics
- CPADS covariates
- References
Double Machine Learning (DML)
- Partially Linear Regression
- Cross-fitting
- Neyman orthogonality
- MORIE implementation
- Interactive Regression Model (IRM)
- Partially Linear IV (PLIV) — LATE estimation
- Nuisance learner defaults
- References
eBAC — Estimated Blood Alcohol Concentration
- Widmark formula
- MORIE variants
- Python API
- eBAC-IPW module
- References
Survey-Weighted Statistics
- Design weights
- Weighted mean and proportion
- Linearization variance
- R usage
- Python usage
- References
Survey Sampling
- Simple Random Sampling
- Stratified Random Sampling
- Cluster Sampling
- Probability Proportional to Size (PPS)
- Horvitz-Thompson and Hájek Estimators
- Bootstrap and Jackknife Variance Estimation
- Effective Sample Size
- Calibration / Raking
- References
Dataset-Agnostic Analysis
- Levels of Measurement
- Dataset Profiling
- Analysis Plan Suggestion
- Usage Example
- References
Psychometric Methods
- Reliability
- Factor Analysis Prerequisites
- Construct Validity
- MAPQ Validated Results
Ontario Restrictive Confinement (OTIS)
- Data
- Alert-State Encoding
- Methods
- Usage
- References
OTIS Linkage Constraints — Read Before Doing Any Individual-Level Analysis
- What the official dictionary says (verified 2026-05-08)
- What this means in practice
- Empirical confirmation
- Which analyses are valid
- Which analyses are INVALID — never trust their output
- How we got here (2026-05-08)
- Sources
MRM modules
- Vocabulary
- MRM (10 estimators)
- Per-row formulations (panel data)
- Aggregate formulations (count outcomes)
- Doob chi-square companion
- Constraints
SIU IAP — Federal Structured Intervention Unit Implementation Advisory Panel
- Panel composition (relevant to morie)
- Two distinct panels
- Reports indexed
- Programmatic access
- See also
Sprott-Doob CRIMSL + Schulich Law SIU analyses
- The four reports
- Headline replicated findings
- Mandela classifier
- χ² verification
- Mandela-RF — applying the classifier to OTIS provincial data
- Position in the MRM stack
- Citations
Doob T-539-20 Federal Court affidavit replication
- Three CCRSO 2018 tables hardcoded
- Decoupling test
- Note on Figures 1-4
- Position in the MRM stack
- Citation
Toronto Police Service (TPS) Statistics
- Modules
- Datasets
- Quick start
- References
Hawkes Self-Exciting Point Processes
- Modules
- Mathematical content
- Asymptotic theory
- Application
- Reference
Spatial Statistics
- Global Autocorrelation
- Local Indicators of Spatial Association (LISA)
- Geostatistical Interpolation
- Geographically Weighted Regression (GWR)
- Spatial Weight Matrices
- Point Pattern Analysis
- Density-Based Clustering
- Kulldorff Space-Time Scan
- Visualisation Helpers
- Spatio-Temporal Extensions
- References
Statistical Physics of Crime
- Methods
- Companion methods (under spatial)
Key Empirical Findings
- Spatial autocorrelation (Toronto Police Service, 2024)
- Hawkes self-exciting point processes (post-2014 TPS)
- OTIS placements (Ontario Tracking and Information System)
- How to reproduce
TurboQuant — Vector Quantization
- Weight Quantization vs KV-Cache Quantization
- References
- Algorithm
- Theoretical Bounds
- Experimental Results
- Implementation
- Codebook Resolution
- Reproduce
- Critical Notes
MORIE Inference Engine
- Architecture
- Components
- MLX Integration — Apple Silicon GPU
- C Kernel Acceleration
- GGUF Loader — Verified Results
- Tokenizer
- Engine — Forward Pass
- KV-Cache Compression — Benchmark Results
- References
Signal Processing & Biomedical Analysis
- Digital Filters
- Spectral Analysis
- Fractal Complexity
- ECG and Heart Rate Variability
- Phonocardiogram (PCG) Analysis
- References
Homomorphic Deconvolution & Cepstral Analysis
- Real Cepstrum
- Complex Cepstrum
- Liftering and Deconvolution
- Application: PCG S1/S2 Decomposition
- References
Post-Quantum Cryptography (Research)
- ML-KEM-768 (FIPS 203)
- ChaCha20-Poly1305 (RFC 8439)
- HKDF-SHA256 (RFC 5869)
- Hybrid KEM-DEM Construction
- CLI Usage
- Keystore
- Security Considerations
Population Genetics
- Sequence-Level Metrics
- Population Differentiation
- Linkage Disequilibrium
- Genome-Wide Association Studies
- Epidemiological Applications
- References
Polyglot REPL
- Language Detection
- Compiled Languages
- Variable Bridging
- Modes
- LLM Chat Integration
- Headless Mode
- Callable Helpers
Deployment
- From source
- Docker
- Ollama Sidecar
- systemd Service
- Install Methods
- Cross-Platform Notes

Quick reference¶

Each entry below names the estimator, the estimand it targets, and the Python function that produces it.

Causal estimators¶

run_propensity_ipw_analysis — IPW (Hájek), ATE
estimate_att — IPW (Hájek), ATT
estimate_atc — IPW (Hájek), ATC
estimate_aipw — AIPW (doubly robust), ATE
estimate_gate — GATE (AIPW per group)
estimate_cate — T-learner / S-learner, CATE (per unit)
estimate_late — 2SLS / Wald IV, LATE
estimate_ate — DML–PLR, ATE
estimate_irm — DML–IRM, heterogeneous ATE
estimate_pliv — DML–PLIV, LATE
estimate_ate_gcomputation — G-computation, ATE
run_ebac_selection_ipw_analysis — eBAC-IPW, selection-adjusted ATE
e_value — E-value sensitivity bound
sensitivity_rosenbaum — Rosenbaum bounds

Logistic / model comparison¶

run_weighted_logistic_analysis — weighted logistic, OR
compare_nested_logistic_models — nested-model LRT

Survey + descriptive statistics¶

morie.survey helpers — survey-weighted CIs and prevalence
horvitz_thompson_total — HT estimator, population total
hajek_mean — Hájek estimator, population mean

Power + Bayes¶

run_power_design_module — N required for a given design
Beta-binomial Bayes — posterior mean / CI (see morie.causal)

Psychometrics¶

crba — Cronbach’s α (internal consistency)
mcdo — McDonald’s ω (reliability)
kmo — KMO sampling adequacy
bart — Bartlett’s sphericity (factorability)
paran — Parallel analysis (factor retention)
crel — Composite reliability
ave — Average Variance Extracted (convergent validity)

OTIS-specific¶

rplace — Regional placement counts / proportions
astcmb — Alert-state combination encoding
volat — Regional volatility (movement metric)
otdml — DML IRM on correctional data, ATE / ATT

MORIE

Table of Contents

Related Topics