publications
in reverse chronological order; * indicates equal contribution.
2026
-
CPALByzantine-Robust Optimization under (L_0, L_1)-SmoothnessIn Conference on Parsimony and Learning (CPAL), Proceedings Track, 2026
-
Where Does Warm-Up Come From? Adaptive Scheduling for Norm-Constrained OptimizersarXiv preprint arXiv:2602.05813, 2026
2025
-
Who to Trust? Aggregating Client Knowledge in Logit-Based Federated LearningarXiv preprint arXiv:2509.15147, 2025
-
Preconditioned Norms: A Unified Framework for Steepest Descent, Quasi-Newton and Adaptive MethodsarXiv preprint arXiv:2510.10777, 2025
-
Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute ScalingarXiv preprint arXiv:2508.16745, 2025
-
Overspecified Mixture Discriminant Analysis: Exponential Convergence, Statistical Guarantees, and Remote Sensing ApplicationsarXiv preprint arXiv:2510.27056, 2025
-
MicromachinesFourier Approximation of magMEMS Oscillations: Neural Network Space HandlingMicromachines, 2025
-
SAMClassifier Performance on Long-Tail DistributionsStatistical Analysis and Data Mining: An ASA Data Science Journal, 2025
-
Stat. PapersConvergence of the EM Algorithm in KL Distance for Overspecified Gaussian MixturesStatistical Papers, 2025
-
Optimizing dynamic pull-in threshold and periodic trajectories for magnetically actuated MEMS (magMEMS) in wearable sensorsFrontiers in Physics, 2025
-
MLJGradient Descent Fails to Learn High-frequency Functions and Modular ArithmeticMachine Learning, 2025
2024
-
ACMLIntractability of Learning the Discrete Logarithm with Gradient-Based MethodsIn Asian Conference on Machine Learning (ACML), 2024
2023
-
NutrientsA Central Asian Food Dataset for Personalized Dietary InterventionsNutrients, 2023
-
ECAILong Tail Theory Under Gaussian MixturesIn European Conference on Artificial Intelligence (ECAI), 2023
-
Application of Image Processing in Evaluation of Hydraulic Fracturing with Liquid Nitrogen: A Case Study of Coal Samples from Karaganda BasinApplied Sciences, 2023
-
Empirical Analysis of the AdaBoost’s Error BoundarXiv preprint arXiv:2302.00880, 2023