Preprints
- G. Lugosi, G. Neu:
Online-to-PAC
Conversions: Generalization Bounds via Regret Analysis. Under
review. [slides]
- B. Abélčs, E. Clerico, G. Neu: Generalization bounds for
mixing processes via delayed online-to-PAC conversions. Under review.
- B. Abélčs, E. Clerico, G. Neu: Online-to-PAC generalization bounds under graph-mixing dependencies. Under review.
- G. Neu, N.
Okolo: Offline RL via Feature-Occupancy Gradient Ascent. Under review.
- S. Calo, A. Jonsson, G.
Neu, L. Schwartz, J. Segovia-Aguas: Bisimulation Metrics are
Optimal Transport Distances, and Can be Computed Efficiently. To appear in Advances in Neural Information
Processing Systems 367(NeurIPS), 2024.
Journal papers
- G. Lugosi, M. G. Markakis, G. Neu: On the hardness of
inventory management with censored demand data. In INFORMS Journal
on Optimization, 2023.
- G. Neu and G.
Bartók: Importance
weighting without importance weights: An efficient algorithm for
combinatorial semi-bandits. In Journal on Machine Learning Research
(JMLR), vol. 17(154), pp. 1-21, 2016.
- L. Devroye, G. Lugosi and G. Neu: Random-Walk
Perturbations for Online Combinatorial Optimization. In IEEE
Transactions on Information Theory,
vol. 61, pp. 4099-4106, 2015.
- G. Neu, A.
György, Cs. Szepesvári and A. Antos: Online Markov
Decision
Processes under Bandit Feedback. In IEEE Transactions on Automatic
Control, vol. 59., pp. 676-691, 2014.
- A.
György and G.
Neu: Near-Optimal
Rates for Limited-Delay Universal Lossy Source
Coding. In IEEE Transactions on Information Theory, vol. 60, pp.
2823-2834, 2014.
- G. Neu and
Cs.
Szepesvári: Training
Parsers by Inverse Reinforcement Learning. In Machine
Learning, vol. 77(2), pp. 303-337, 2009.
Refereed conference papers
- G. Neu, M.
Papini, L. Schwartz: Optimistic
Information-Directed Sampling. In Proceedings of the 36th
Annual Conference on Learning Theory (COLT), pp. 3970-4006, 2024.
- G. Neu, N.
Okolo: Dealing with
Unbounded Gradients in Stochastic Saddle-Point Optimizaiton. In proceedings of the 41st International Conference on
Machine Learning (ICML), pp. 37508-37530, 2024.
- G. Gabbianelli, G.
Neu, N. Okolo, M. Papini: Offline Primal-Dual
Reinforcement Learning for Linear MDPs. In Proceedings of
the Twenty-seventh International Conference on Artificial Intelligence
and
Statistics (AISTATS), pp.
3169-3177, 2024.
- G. Neu, J.
Olkhovskaya, S. Vakili: Adversarial
Contextual Bandits Go Kernelized. In Proceedings of the 34th
International Conference
on
Algorithmic Learning Theory (ALT), pp.
907-929, 2024.
- G. Gabbianelli, G.
Neu, M. Papini: Importance-Weighted
Offline Learning Done Right. In Proceedings of the 34th
International Conference
on
Algorithmic Learning Theory (ALT), pp.
614-634, 2024.
- J. Olkhovskaya, J.
Mayo, T. van Erven, G. Neu,
C.-Y. Wei: First-and
Second-Order Bounds for Adversarial Linear Contextual Bandits. In
Advances in Neural Information
Processing Systems 36
(NeurIPS), 2023.
- A. Moulin, G. Neu: Optimistic planning
via regularized dynamic programming. In Proceedings of the
40th International Conference on Machine Learning (ICML), pp.
25337-25357, 2023.
- L. Zierahn, D. van der
Hoeven, N. Cesa-Bianchi, G. Neu:
Nonstochastic
Contextual Combinatorial Bandits. In Proceedings of
the Twenty-sixth International Conference on Artificial Intelligence
and
Statistics (AISTATS), pp. 8771-8813, 2023.
- G. Neu, N. Okolo: Efficient Global
Planning in Large MDPs via Stochastic Primal-Dual Optimization. In
Proceedings of the 34th International Conference
on
Algorithmic Learning Theory (ALT), pp. 1101-1123, 2023.
- G. Gabbianelli, G. Neu, M. Papini: Online learning with
off-policy feedback. In Proceedings of the 34th International
Conference
on
Algorithmic Learning Theory (ALT), pp. 620-641, 2023.
- L. Viano, A. Kamoutsi,
G. Neu, I.
Krawczuk, V. Cevher: Proximal Point
Imitation Learning. In Advances in Neural Information
Processing Systems 35
(NeurIPS), 2022.
- G. Neu, J. Olkhovskaya, M. Papini,
L. Schwartz: Lifting
the information ratio: An information-theoretic analysis of Thompson
sampling for Contextual bandits. In Advances in Neural Information
Processing Systems 35
(NeurIPS), 2022.
- G. Lugosi and G. Neu: Generalization bounds
via convex analysis. In Proceedings of the 34th Annual Conference
on Learning Theory (COLT), pp. 3524-3546, 2022. [slides]
- G.
Neu and J. Olkhovskaya: Online learning in
MDPs with linear function approximation and bandit feedback. In
Advances in Neural Information
Processing Systems 34
(NeurIPS), pp. 10407-10417, 2021.
- G. Neu, G. K. Dziugaite, M.
Haghifam, D. M. Roy: Information-Theoretic
Generalization Bounds for Stochastic Gradient Descent. In
Proceedings of the 33nd Annual Conference on Learning Theory (COLT),
pp. 3526-3545, 2021.
- J. Bas-Serrano, S.
Curi, A. Krause and G. Neu: Logistic Q-Learning.
In Proceedings of
the Twenty-fourth International Conference on Artificial Intelligence
and
Statistics (AISTATS), pp. 3610-3618, 2021. [slides]
- G. Neu and C. Pike-Burke: A Unifying View of
Optimism in Episodic Reinforcement Learning. In Advances in Neural
Information
Processing Systems 33
(NeurIPS), pp. 1392-1403, 2020. [slides]
- G. Neu and J. Olkhovskaya: Efficient and robust
algorithms for adversarial linear contextual bandits. In
Proceedings of the 32nd Annual Conference on Learning Theory (COLT),
pp. 3049-3068, 2020.
- G. Neu and N. Zhivotovskiy: Fast rates for online
prediction with abstention. In Proceedings of the 32nd Annual
Conference on Learning Theory (COLT), pp. 3030–3048, 2020.
- J. Bas-Serrano and G. Neu: Faster saddle-point
optimization for solving large-scale Markov decision processes. In
Conference on Learning for Dynamics and Control (L4DC), pp. 413–423,
2020.
- N. Mücke, G. Neu and L. Rosasco: Beating SGD
saturation with tail-averaging and minibatching. In Advances in
Neural Information
Processing Systems 32
(NeurIPS), pp. 12568-12577, 2019.
- C. Riquelme, H.
Penedones, D. Vincent, H. Maennel, S. Gelly, T. Mann, A. Barreto and G. Neu: Adaptive Temporal-Difference
Learning for Policy Evaluation with Per-State Uncertainty Estimates.
In Advances in Neural Information
Processing Systems 32
(NeurIPS), pp. 11872-11882, 2019.
- W. Kotłowski and G. Neu: Bandit Principal
Component Analysis. In Proceedings of the 32nd Annual Conference on
Learning Theory (COLT), pp. 1994-2024, 2019. [slides]
- G. Lugosi, G. Neu and J. Olkhovskaya : Online influence
maximization with local observations. In Proceedings of the 30th
International Conference
on
Algorithmic Learning Theory (ALT), pp. 557-580, 2019.
- G. Neu and L. Rosasco: Iterate averaging as
regularization for stochastic gradient descent. In Proceedings of
the 31st Annual Conference on Learning Theory (COLT), pp. 3222-3242,
2018.
- N. Cesa-Bianchi, C.
Gentile, G. Lugosi and G. Neu:
Boltzmann
exploration done right. In Advances in Neural Information
Processing Systems 30
(NeurIPS), pp. 6284-6293, 2017. [poster]
- G. Neu and V. Gómez: Fast rates for online
learning in Linearly Solvable Markov Decision Processes. In
Proceedings of the 30th Annual Conference on Learning Theory (COLT),
pp. 1567-1588, 2017. [slides]
- T. Liu, G. Lugosi, G. Neu and D. Tao: Algorithmic
stability and hypothesis complexity. In Proceedings of the
34th International Conference on Machine Learning (ICML), pp. 2159-2167, 2017.
- T. Kocák, G. Neu and M. Valko: Online
learning with Erdős-Rényi side-observation graphs. In Proceedings
of
the 32nd Conference on
Uncertainty in Artificial Intelligence (UAI), pp.
339-346, 2016.
- T. Kocák, G. Neu and M. Valko: Online learning with
noisy side observations. In Proceedings of
the Nineteenth International Conference on Artificial Intelligence and
Statistics (AISTATS), pp. 1186-1194, 2016.
- G. Neu: Explore no more:
Improved high-probability regret bounds for non-stochastic bandits.
In Advances in Neural Information
Processing Systems
28
(NeurIPS), pp. 3150-3158, 2015. [poster] [slides]
- G. Neu: First-order regret
bounds for combinatorial semi-bandits. In Proceedings of the 28th
Annual Conference on Learning Theory (COLT), pp. 1360-1375, 2015.[poster] [slides]
- G. Neu and M. Valko: Online Combinatorial
Optimization with Stochastic Decision Sets and Adversarial Losses.
In Advances in Neural Information Processing Systems
27
(NeurIPS), pp. 2780-2788, 2014. [poster] [slides]
- T. Kocák, G.
Neu, M. Valko and R. Munos: Efficient Learning
by Implicit Exploration
in Bandit Problems with Side Observations. In Advances in Neural
Information Processing Systems
27
(NeurIPS), pp. 613-621, 2014. [poster] [slides]
- A. Sani, G. Neu and A. Lazaric: Exploiting Easy Data
in Online Optimization. In Advances in Neural Information
Processing Systems
27
(NeurIPS), pp. 810-818, 2014. [poster] [spotlight] [talk]
- A. Zimin and G.
Neu: Online
Learning in Episodic Markov Decision Processes by Relative Entropy
Policy Search. In Advances in Neural Information Processing Systems
26
(NeurIPS), pp. 1583-1591, 2013. [poster] [slides]
- G. Neu and G.
Bartók: An
Efficient Algorithm for Learning with Semi-Bandit
Feedback. In Proceedings of the 24th International Conference
on
Algorithmic Learning Theory (ALT), pp. 234-248, 2013. [poster] [slides] Full version in JMLR '16.
- L. Devroye, G.
Lugosi and G. Neu: Prediction
by Random-Walk Perturbation. In
Proceedings of the 26th Annual Conference on Learning Theory (COLT),
pp. 460-473, 2013. [slides] Full version in IEEE T-IT '15.
- G.
Neu, A.
György, and Cs. Szepesvári: The Adversarial
Stochastic Shortest Path
Problem with Unknown Transition Probabilities. In Proceedings of
the
Fifteenth International Conference on Artificial Intelligence and
Statistics (AISTATS), pp. 805-813, 2012. [supplement] [poster]
- A. György and G.
Neu: Near-Optimal
Rates
for Limited-Delay Universal Lossy Source Coding. In 2011 IEEE
International Symposium
on
Information Theory, pp. 2218-2222, 2011.Full
version in IEEE T-IT '14.
- G. Neu, A.
György, Cs. Szepesvári and A. Antos: Online Markov
Decision
Processes under Bandit Feedback. In Advances in Neural Information
Processing Systems 23 (NeurIPS), pp. 1804-1812, 2010. [poster] [spotlight] Full version in IEEE TAC '14.
- G. Neu, A.
György, and Cs. Szepesvári: The Online Loop-free
Stochastic
Shortest-Path Problem. In Proceedings of The 23rd Conference on
Learning Theory (COLT), pp. 231-243, 2010.
- G. Neu and Cs.
Szepesvári: Apprenticeship
Learning using Inverse Reinforcement
Learning and Gradient Methods.
In Proceedings of the 23rd
Conference on
Uncertainty in Artificial Intelligence (UAI), pp. 295-302, 2007.
Other
A note on author order
My
default rule is to list author names alphabetically, and most of my
papers published since 2015 follow this convention. In case you are
curious about the degree of contribution of each author for a specific
paper, do not hesitate to reach out to me or any of my coauthors.
|