ENH: optimize._chandrupatla: add array API support #20689

mdhaber · 2024-05-10T09:01:08Z

Reference issue

What does this implement/fix?

This adds array API support to scipy.optimize._chandrupatla and paves the way toward adding array API support to the other elementwise iterative methods (e.g. _differentiate, _tanhsinh, _nsum, _chandrupatla_minimize, and the bracket finders). I'll propose making the rootfinder, minimizer, and bracket finders public shortly.

Additional information

The performance of this function compared to the existing bracketing rootfinders is worst when function evaluations are inexpensive, so I compared the performance between

brentq
_chandrupatla (NumPy)
_chandrupatla (PyTorch, CPU)
_chandrupatla (CuPy)

when finding the root of xp.cos(x) - p for many values of p. The bracket is always $[0, \pi]$. brentq's default absolute x-tolerance is 2e-12, so I set _chandrupatla to 1e-12 to be more than fair, and I confirmed that both solvers meet their respective tolerance. _chandrupatla finds the roots in a single vectorized call whereas brentq loops with a list comprehension.

# import time
# import numpy as np
# import cupy as cp
# import torch
# import matplotlib.pyplot as plt
#
# rng = np.random.default_rng(1638083107694713882823079058616272161)
# from scipy import optimize
# from scipy.optimize._chandrupatla import _chandrupatla
#
# xp = torch
# n = 10
# n = int(n)
# a = xp.asarray(0)
# b = xp.asarray(np.pi)
# ps = xp.asarray(rng.random(size=n) * 2 - 1, dtype=xp.float64)
#
# def f(x, p):
#     return xp.cos(x) - p
#
# tic = time.perf_counter()
# res = _chandrupatla(f, a, b, args=(ps,), xatol=1e-12)
# toc = time.perf_counter()

import time
import numpy as np
import cupy as cp
import torch
import matplotlib.pyplot as plt

rng = np.random.default_rng(1638083107694713882823079058616272161)
from scipy import optimize
from scipy.optimize._chandrupatla import _chandrupatla

ns = np.logspace(0, 6, 30)

times = {np: [], cp: [], torch: [],  'brentq':[]}

for xp in [np, cp, torch, 'brentq']:
    brentq = False
    if xp == 'brentq':
        xp = np
        brentq = True

    for n in ns:
        n = int(n)
        a = xp.asarray(0)
        b = xp.asarray(np.pi)
        ps = xp.asarray(rng.random(size=n) * 2 - 1, dtype=xp.float64)

        def f(x, p):
            return xp.cos(x) - p

        if brentq:
            tic = time.perf_counter()
            ref = [optimize.brentq(f, a, b, args=(p,)) for p in ps]
            toc = time.perf_counter()
            times['brentq'].append(toc - tic)
            np.testing.assert_allclose(ref, np.arccos(ps), atol=2e-12)
            continue

        tic = time.perf_counter()
        res = _chandrupatla(f, a, b, args=(ps,), xatol=1e-12)
        toc = time.perf_counter()
        times[xp].append(toc - tic)
        np.testing.assert_allclose(cp.asnumpy(res.x), np.arccos(cp.asnumpy(ps)), atol=1e-12)

plt.loglog(ns, times[np], label='np')
plt.loglog(ns, times[cp], label='cp')
plt.loglog(ns, times[torch], label='torch')
plt.loglog(ns, times['brentq'], label='brentq')
plt.xlabel('number of roots')
plt.ylabel('execution time (s)')
plt.title('Root of `xp.cos(x) - p`')
plt.legend()
plt.show()

The function has a lot of overhead due to bells and whistles (e.g. input validation with nice error messages, rich result object, callback function support, etc.). But for solving a lot of equations, the overhead of function calls eventually becomes problematic for brentq.

In this case, you could probably get better performance with cython_optimize, but for more expensive functions with overhead, like finding the argus(1) distribution ppf (#17719 (comment)), the advantage is much more pronounced.

import time
import numpy as np
import cupy as cp
import torch
import matplotlib.pyplot as plt

rng = np.random.default_rng(1638083107694713882823079058616272161)
from scipy import optimize, stats
from scipy.optimize._chandrupatla import _chandrupatla

ns = np.logspace(0, 4, 30)

times = {np: [], cp: [], torch: [], 'brentq':[]}

for xp in [np, 'brentq']:
    brentq = False
    if xp == 'brentq':
        xp = np
        brentq = True

    for n in ns:
        n = int(n)
        a = 0.001
        b = 0.999
        ps = np.linspace(0.005, 0.995, n)

        dist = stats.argus(1)
        def f(x, p):
            return dist.cdf(x) - p

        if brentq:
            tic = time.perf_counter()
            ref = [optimize.brentq(f, a, b, args=(p,)) for p in ps]
            toc = time.perf_counter()
            times['brentq'].append(toc - tic)
            continue

        tic = time.perf_counter()
        res = _chandrupatla(f, a, b, args=(ps,), xatol=1e-12)
        toc = time.perf_counter()
        times[xp].append(toc - tic)

plt.loglog(ns, times[np], label='np')
# plt.loglog(ns, times[cp], label='cp')
plt.loglog(ns, times['brentq'], label='brentq')
plt.xlabel('number of probabilities')
plt.ylabel('execution time (s)')
plt.title('Root of `argus(1).cdf(x) - p`')
plt.legend()
plt.show()

newton is definitely faster than this function, but it should be - the algorithm converges faster. (The advantage of bracketing methods is that convergence is guaranteed if the bracket is valid.) We can add that as another method with the same framework.

~~The tests are quite strict; I'm ironing out a few failures with alternative backends.~~ Done if CI looks good.

The function currently uses fancy indexing assignment. I can investigate working around that for strict array API support later.

tupui

Nice, I just have a few questions, the rest is pretty straightforward and LGTM 👍

scipy/_lib/_array_api.py

scipy/_lib/_elementwise_iterative_method.py

scipy/optimize/_chandrupatla.py

tupui

Last changes LGTM, letting the CI run and then I think it's good to merge.

mdhaber · 2024-05-13T14:24:14Z

Ok! Remaining failures seem unrelated - a package install conflict and slow tests, hopefully one of which is a temporary glitch.

tupui · 2024-05-13T17:19:48Z

Ark you have conflicts now

scipy/optimize/_bracket.py

mdhaber added 3 commits May 9, 2024 11:30

ENH: _lib._elementwise_iterative_method: add array API support

00ac018

ENH: optimize._chandrupatla: add array API support

287acb4

TST: optimize._chandrupatla: test array API support

2f2c909

mdhaber added enhancement A new feature or improvement scipy.optimize array types Items related to array API support and input array validation (see gh-18286) labels May 10, 2024

mdhaber requested a review from tupui May 10, 2024 09:01

github-actions bot added scipy.integrate scipy._lib labels May 10, 2024

mdhaber removed the scipy.integrate label May 10, 2024

tupui self-assigned this May 10, 2024

tupui reviewed May 10, 2024

View reviewed changes

mdhaber added 6 commits May 12, 2024 11:49

STY: optimize._chandrupatla: fix PEP8

4bbcfff

Merge remote-tracking branch 'upstream/main' into xp_eim

0e5ee28

MAINT: optimize._chandrupatla: fix first torch special case bug

96a2701

MAINT: optimize._chandrupatla: fix torch issue in test_vectorization

277c184

MAINT: optimize._chandrupatla: fix second torch special case issue

cac99cf

TST: optimize._chandrupatla: remove unnecessary test

ed9eec5

mdhaber marked this pull request as ready for review May 12, 2024 20:28

mdhaber requested review from andyfaff and steppi as code owners May 12, 2024 20:28

mdhaber requested a review from tupui May 12, 2024 20:28

tupui approved these changes May 12, 2024

View reviewed changes

mdhaber added 2 commits May 12, 2024 20:12

MAINT: _lib.xp_clip: resolve array_api_strict complaint

b02d601

MAINT: _lib.xp_clip: fix dtype bug

84765ae

Merge branch 'main' into xp_eim

b3c4769

mdhaber commented May 13, 2024

View reviewed changes

scipy/optimize/_bracket.py Outdated Show resolved Hide resolved

Update scipy/optimize/_bracket.py

f0c89ea

tupui merged commit be0d426 into scipy:main May 13, 2024
28 of 31 checks passed

tupui added this to the 1.14.0 milestone May 13, 2024

lucascolley mentioned this pull request May 17, 2024

ENH: array types: add JAX support #20085

Merged

3 tasks

mdhaber mentioned this pull request May 26, 2024

ENH: optimize.elementwise: vectorized scalar optimization and rootfinding tools #20800

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: optimize._chandrupatla: add array API support #20689

ENH: optimize._chandrupatla: add array API support #20689

mdhaber commented May 10, 2024 •

edited

tupui left a comment

tupui left a comment

mdhaber commented May 13, 2024

tupui commented May 13, 2024

ENH: optimize._chandrupatla: add array API support #20689

ENH: optimize._chandrupatla: add array API support #20689

Conversation

mdhaber commented May 10, 2024 • edited

Reference issue

What does this implement/fix?

Additional information

tupui left a comment

Choose a reason for hiding this comment

tupui left a comment

Choose a reason for hiding this comment

mdhaber commented May 13, 2024

tupui commented May 13, 2024

mdhaber commented May 10, 2024 •

edited