GSoC 2025 with Optimagic: Adding More Optimizer Interfaces to Optimagic¶

Intro 📝¶

This is my final report and blog post for my Google Summer of Code 2025 project, titled ‘Adding More Optimizer Interfaces to Optimagic’, under the NumFOCUS organization, working on the Optimagic Project, supported by NumFOCUS.

Motivation¶

I am a postgraduate student in Mathematics. What interests me are topics from topology, and numerical optimization. I worked on numerical methods and BVP’s for my dissertation. Working with optimagic gave me the perfect reason and oppportunity to dive into optimization techniques. ( not the boring things my coursework covered ) , but introduced me to the state-of-the-art algorithms used in real-world applications.

Why Optimization Needs a Little Magic¶

Optimization is challenging because real-world problems are complex. Functions can be non-linear, non-differentiable, or have multiple optima, making it tough to pinpoint the best solution. For instance, in machine learning, black-box functions where only inputs and outputs are known can be noisy, high-dimensional, or feature steep valleys and flat plateaus, which challenge most algorithms. Some problems require finding global optima, while others need local solutions, and choosing the wrong algorithm can lead to poor results. No single algorithm solves every optimization problem. Depending on the problem’s characteristics, one algorithm may outperform another.

Moreover, switching between algorithms often involves rewriting code between libraries, which is frustrating for researchers and developers focused on solving their problems.

Optimagic addresses these challenges by offering a unified interface for a wide range of optimizers, from gradient-based to derivative-free. Users can switch algorithms seamlessly without modifying their code.

Optimagic : A Unified Interface to Optimizers¶

Optimagic allows users to experiment with any supported optimizer using a consistent interface similar to that of scipy’s. Simply change the algorithm, and optimagic handles the rest. Featuring a few bits,

Flexibility at its core.¶

PyTrees enable Optimagic to handle a wide variety of input formats, making it highly flexible.

Consistency is the key¶

Optimagic standardizes parameter names across optimizers. For example, all stopping criteria start with stopping_ like stopping_maxiter (maximum number of iterations), or all convergence criteria begin with convergence_ likeconvergence_ftol_rel (relative function tolerance).

Code Contributions 💻¶

The primary objective was to add more optimizers to Optimagic, also some additional changes were done. These are detailed below.

Optimizers from Nevergrad (Merged)¶

This was a pretty big PR and I worked on this from Week 1 to Week 4.

Why Nevergrad?¶

Nevergrad offers a robust set of derivative-free optimization algorithms. Integrating these into Optimagic would allows users to leverage cutting-edge methods through the same API. In machine learning and AI, black-box functions—where only inputs and outputs are known, and the internal workings are non-differentiable and non-convex benefit greatly from derivative-free methods, which effectively explore the function landscape to find optima. These optimizers from Nevegrad can now be used in optimagic.

Covariance Matrix Adaptation Evolution Strategy (CMA-ES)
CMA-ES has many parameters, and I analyzed them to identify those most critical to its adaptation process.
OnePlusOne Evolution Strategy
This is a simplified variant of CMA-ES where (μ,λ)=(1,1), meaning one parent generates one offspring per iteration.
Random Search
A one-shot method for sampling the search space which serves as the baseline.
Sampling Search
Improved over Random Search.
Differential Evolution
A evolution strategy method for global optimization.
Bayesian Optimization
A wrapper around the bayes_optim package.
Estimation of Distribution Algorithm (EDA)
A probabilistic method for adaptive sampling.
Test-Based Population Sampling Adaptation
An algorithm for continuous noisy optimization.
Estimation of Multivariate Normal Algorithm (EMNA)
A probabilistic method for adaptive sampling.
NGOPT Optimizers
NGOpt (Nevergrad Optimizer) is the optimizer selection wizard of Nevergrad. Nevergrad’s meta-optimizers dynamically switch between algorithms based on the function landscape and optimization history.
META Optimizers
These combine derivative-free global optimizers with derivative-based local optimizers to refine solutions.

Note

We skipped SPSA from Nevergrad due to failing tests and lack of tunable parameters. I also excluded optimizers like ConfSplit, since similar functionality can be achieved through multiple optimization runs.

Example usage¶

import optimagic as om
om.minimize(
    fun=lambda x: x@x,
    params=om.Bounds(lower=np.full(3,-5), upper=np.full(3,5)),
    algorithm=om.algos.nevergrad_meta(optimizer="BFGSCMAPlus")
)

Adding `needs_bounds` and `supports_infinite_bounds` fields in the AlgoInfo (Merged)¶

During Week 5 and Week 6 I worked on this PR. There was a open issue on this topic.

While global optimizers typically require bounds, optimizers from Nevergrad can operate without bounds which in case sample from a normal distribution with given standard deviation. Local algorithms often run unbounded, while global ones need bounds.I researched which algorithms could run without bounds and which supported infinite bounds. As a result, we added two new fields to the AlgoInfo class: needs_bounds and supports_infinite_bounds.
Example code snippet

from optimagic.algorithms import AVAILABLE_ALGORITHMS

algos_with_bounds_support = [
    algo
    for name, algo in AVAILABLE_ALGORITHMS.items()
    if algo.algo_info.supports_bounds
]
my_selection = [
    algo for algo in algos_with_bounds_support if algo.algo_info.needs_bounds
]

my_selection2 = [
    algo
    for algo in algos_with_bounds_support
    if algo.algo_info.supports_infinite_bounds
]

Migrate Nevergrad optimizers to new documentation style (Open)¶

Previously, documentation was stored in algorithms.md and optimizers in optimizers.py. I migrated Nevergrad optimizers to Optimagic’s new documentation style, embedding details in class docstrings and parameter docstrings to follow the new style. With this, I also cleaned up any remaining work and linked issues.

Wrap Local Optimizers from Gradient-Free Optimizers (Open)¶

After discussing with my mentor, we decided to wrap optimizers from this library and worked on this and following PR during Week 7 to Week 11.

These optimizers work on a discrete search space, requiring bounds to be translated into a grid, used helper functions for this process. I also added documentation and tests to ensure reliability.

A challenge was that these global optimizers struggled to pass tests due to low accuracy and search space limitations.

These are all local algorithms:

Hill Climbing
A local optimization method that iteratively moves to better neighboring solutions.
Stochastic Hill Climbing
A variant of hill climbing that incorporates randomness to escape local optima.
Simulated Annealing
A probabilistic technique that mimics the cooling process to find global optima, though used here for local optimization.
Repulsing Hill Climbing
A variation of hill climbing that avoids revisiting solutions.
Downhill Simplex Optimization
A derivative-free method using a simplex to navigate the search space.
Powell’s Method
A local optimization technique using conjugate directions.

I am very thankful to the developer of Gradient-Free Optimizers for patiently helping me understand the workings and clarify any doubts which I had through this issue.
No improvement even after many iterations with some algorithms

Wrap Population-Based Optimizers from Gradient-Free Optimizers (Open)¶

Thanks to the exposed converter, population-based algorithms can now be initialized with a initial population. The following algorithms are now available in optimagic.

Particle Swarm Optimization
A population-based method inspired by the social behavior of flocks.
Spiral Optimization
A population-based method that searches the space in a spiral pattern.
Genetic Algorithm
A population-based method using principles of natural selection (genes).
Evolution Strategy
A population-based method that evolves a population of solutions (actual values).
Differential Evolution
A population-based method for global optimization, also implemented in Nevergrad.

Rework `test_many_algorithms`¶

We refactored the test suite to include dynamic tests for needs_bounds and supports_infinite_bounds. We also introduced a dictionary to specify precision requirements for certain algorithms.

New Example in class SphereExampleInternalOptimizationProblemWithConverter¶

PyTrees enable Optimagic to handle diverse inputs. Previously, a test object for PyTrees was missing, so I added an example in internal_optimization_problem.py to support testing with dictionary inputs via converter functions.

Example snippet

import numpy as np
from optimagic.optimization.internal_optimization_problem import SphereExampleInternalOptimizationProblemWithConverter
from optimagic.typing import AggregationLevel
problem = SphereExampleInternalOptimizationProblemWithConverter(solver_type=AggregationLevel.LEAST_SQUARES)
problem.converter.params_to_internal({"x0":2,"x1":3})
problem.converter.derivative_to_internal(
    {
    "x0":{
         "x0":2,
         "x1":3,
        },
   "x1":{
       "x0":2,
       "x1":3
       }
    },
    [2,])

Add L-BFGS optimizer from pyensmallen (Open)¶

ensmallen is a fast C++ library for efficient objective functions. This issue is pending due to inactivity from the pyensmallen repository . Once a dependent PR is merged, this will be completed.
Cleaned Report callback PR

Issues raised by me:¶

What we missed ?¶

The original proposal was to wrap pyensmallen, KNITRO, PRIMA and Nevergrad. Work on pyensmallen is still pending due to insufficent response from the repository maintainer. We also had to drop KNITRO, which required a paid license. Further, we had to drop PRIMA too, which had a Python interface but wasn’t published on PyPI. After discussing with my mentor, we changed the project goals to accomodate for these setbacks.

What we achieved ?¶

We successfully wrapped all optimizers from Nevergrad, a very popular library, enabling Optimagic users to access its powerful derivative-free algorithms. We also wrapped several optimizers from the Gradient Free Optimizers library, with some work still pending, which I will continue to pursue. In the meantime, we made progress on other tasks, such as adding new fields (needs_bounds and supports_infinite_bounds) to AlgoInfo and small improvements here and there. Despite some setbacks, we achieved remarkable progress.
We can proudly say that Optimagic now offers a more comprehensive toolkit for tackling optimization challenges.

Future Work¶

Wrap Grid Search and SMBO-Based Optimizers from Gradient-Free Optimizers¶

I plan to wrap Grid Search and other optimizers, including:

Bayesian Optimization
Tree-Structured Parzen Estimators
Forest Optimization

Maintenance 🛠️¶

Support for nonlinear constraints with optimizers from Nevergrad
Document particular portfolio and meta optimizers in nevergrad_meta and nevergrad_ngopt which are not documented sufficiently in the nevergrad documentation but which can be sourced through other papers.
Support for nonlinear constraints with optimizers from Gradient-Free Optimizers

Acknowledgements¶

I am deeply grateful to the following individuals and institutions for their support:

Firstly, I thank my GSoC mentors Janos Gabler and Tim Mensinger for their warm welcome, openness to new ideas, and fostering a constructive and engaging discussion environment. My mentor Janos Gabler has always emphasised quality over quantity which has cultivated best practices in me for which I’m deeply grateful.

I also appreciate the contributions of community members who provided valuable feedback and comments on my pull requests.

Finally, I express my gratitude to NumFOCUS and the Google Summer of Code program for providing the opportunity and financial support, enabling me to pursue my academic interests and enhance my technical skills with minimal constraints.

GSoC 2025 with Optimagic: Adding More Optimizer Interfaces to Optimagic¶

Intro 📝¶

Motivation¶

Why Optimization Needs a Little Magic¶

Optimagic : A Unified Interface to Optimizers¶

Flexibility at its core.¶

Consistency is the key¶

Code Contributions 💻¶

Optimizers from Nevergrad (Merged)¶

Why Nevergrad?¶

Example usage¶

Adding needs_bounds and supports_infinite_bounds fields in the AlgoInfo (Merged)¶

Migrate Nevergrad optimizers to new documentation style (Open)¶

Wrap Local Optimizers from Gradient-Free Optimizers (Open)¶

Wrap Population-Based Optimizers from Gradient-Free Optimizers (Open)¶

Rework test_many_algorithms¶

New Example in class SphereExampleInternalOptimizationProblemWithConverter¶

Add L-BFGS optimizer from pyensmallen (Open)¶

Issues raised by me:¶

What we missed ?¶

What we achieved ?¶

Future Work¶

Wrap Grid Search and SMBO-Based Optimizers from Gradient-Free Optimizers¶

Maintenance 🛠️¶

Acknowledgements¶

Adding `needs_bounds` and `supports_infinite_bounds` fields in the AlgoInfo (Merged)¶

Rework `test_many_algorithms`¶