How To Produce Ultra-Compact Vector Graphic Plots With Orthogonal Distance Fitting

1. Introduction

are a natural medium for visualizing mathematical functions. With vector graphics, a function is approximated by segments of connected cubic Bézier curves that are rasterized (i.e. converted into pixels) when they are displayed [1]. Because rasterization is delayed, vector graphic images are naturally more portable than pixel-based images as the rendering can be tailored to its display environment. No matter how much you zoom in on a vector graphics plot, you will always see crisp line segments; whereas if you zoom in enough on a rasterized image, you will eventually see the grainy blocks that represent the pixels.

While vector graphics are an excellent format for plotting, producing good vector graphics can be challenging. Consider, for example, how we might adopt this example plot from matplotlib [2] for the function f(t) = exp(−t)cos(−2πt), 0 ≤ t ≤ 5, to produce an SVG image:

import matplotlib.pyplot as plt
import numpy as np
def f(t):
return np.exp(-t) * np.cos(2*np.pi*t)
t = np.arange(0.0, 5.0, 0.02)
plt.plot(t, f(t))
plt.savefig(‘fig.svg’)

Piecewise linear plot of the function exp(−t)cos(−2πt), 0 ≤ t ≤ 5, with 250 segments. Figure by author.

Matplotlib approximates the smooth plot with 250 piecewise connected line segments (shown with the alternating colors). While the plot looks good, it is much larger than it needs to be. If, instead, we were producing a rasterized image like a PNG, the details of how the curve is constructed wouldn’t matter; but with a vector graphic the individual line segments are passed through and preserved in the outputted image. With some adjustments, though, we can improve the size substantially without sacrificing the quality of the image.

The basic primitive of vector graphics is the parametric cubic, represented as a cubic Bézier curve. With piecewise cubic Bézier curves, we have a lot more knobs we can adjust to approximate functions than if we restrict ourselves to piecewise linear or piecewise (nonparametric) cubic segments. Using the orthogonal distance fitting (ODF) algorithm from this paper, we can produce a plot of the function that requires only 8 Bézier segments and is visually indistinguishable from the matplotlib graphic. Below I show the plot’s representation in TikZ. Later I will show how we can easily turn the TikZ command into an SVG image for the web using MetaPost.

\draw (0.00000, 2.50000) .. controls (0.39088, 2.10882) and (0.61233, -1.05415) ..
(1.18750, 0.36510) .. controls (1.60393, 1.31193) and (1.71252, 2.10800) ..
(2.37500, 0.95127) .. controls (2.88922, 0.15365) and (3.15477, 0.95162) ..
(3.56250, 1.11921) .. controls (3.98299, 1.31658) and (4.26123, 0.78781) ..
(4.75000, 0.82415) .. controls (5.02221, 0.81871) and (5.38203, 1.12388) ..
(5.93750, 0.99939) .. controls (5.96426, 1.01981) and (6.36975, 0.82102) ..
(7.12500, 0.95127) .. controls (8.08129, 1.04760) and (7.44859, 0.87986) ..
(9.50000, 0.96171);

Plot of the function exp(−t)cos(−2πt), 0 ≤ t ≤ 5, with Bézier curves fit using ODF. Figure by author.

The algorithm builds on Alvin Penner’s work [3]. Given a parametric function f, it first fits a Chebyshev series to approximate f. For analytic functions, interpolation in Chebyshev nodes provides rapid geometric convergence [4]. This is far better than the typically fourth order convergence you will get with interpolation in cubic splines and much better than the quadratic convergence you will get with piecewise linear interpolation [5]. By interpolating in Chebyshev nodes, we can reduce the number of times we need to evaluate f. Using a trust-region optimizer, the algorithm then looks for a cubic Bézier curve that optimally approximates the target function. If the maximal orthogonal distance between the fitted Bézier curve and the Chebyshev series for f is less than a target threshold, then, great, we’re done; otherwise, we split the domain in two and repeat the process. I detail the steps below.

Algorithm F (fit a Bézier path to a function). Given a parametric function f (t) = (f_x (t), f_y (t)), a ≤ t ≤ b, fit a Bezier path, g, to f so that

F1. Using the algorithm developed by Jared Aurentz and Lloyd Trefethen from [7], fit Chebyshev series f~_x , f~_y to f_x, f_y. When f is analytic or differentiable to a high degree, the fit is usually close to machine precision so going forward we assume that we can use f~ as a proxy for f and that any loss in accuracy will be negligible.

F2. Using a trust region optimizer [8] and Penner’s algorithm [3], fit a Bezier curve g to minimize

More details for this step are provided in §3.

F3. Compute

If M is less than the target threshold, terminate; otherwise, set

and repeat steps F1 through F3 for f_l and f_r until the threshold is reached. See §4 for more details.

In the next section, I describe how to fit arbitrary functions with Algorithm F using the Python package bbai (https://github.com/rnburn/bbai).

2. Functionality Tour

The below code demonstrates the basic task of fitting a function to a specified window:

from bbai.graphics import BezierPath
import numpy as np
def f(t):
return np.exp(-t * t)
path = BezierPath(
dst_xmin=0, dst_xmax=9.5,
dst_ymin=0, dst_ymax=2)
path.fit(f, -2, 2)
print(path.tikz_)

outputs

\draw (0.000, 0.000)..controls (2.684, 0.092) and (3.273, 1.952)
.. (4.750, 2.000)..controls (6.229, 1.951) and (6.815, 0.092)
.. (9.500, 0.000);

Gaussian, fit with two Bézier curves. Figure by author.

By default, the library will scale the plot so that the function just fits in the window defined by dst_xmin, dst_xmax, dst_ymin, and dst_ymax; but that can be changed by also specifying a source window with src_xmin, src_xmax, src_ymin, and src_ymax. The algorithm uses 1.0 × 10^-2 as the default maximum orthogonal distance threshold which, for perspective, is one hundredth of TikZ’s default line width.

We can also fit parametric functions by providing both an x and a y function.

from bbai.graphics import BezierPath
import numpy as np
R, r, d = 5, 3, 5
def fx(t):
return (R-r)*np.cos(t) + d*np.cos((R-r)/r*t)
def fy(t):
return (R-r)*np.sin(t) – d*np.sin((R-r)/r*t)
path = BezierPath(
dst_xmin=0, dst_xmax=2.5,
dst_ymin=0, dst_ymax=2.5)
path.fit(fx, fy, 0, 2*np.pi*r*d/R)
print(path.tikz_)

outputs

\draw (2.500, 1.250)..controls (2.533, 1.060) and (1.400, 0.745)
.. (0.850, 0.578)..controls (-0.049, 0.321) and (-0.145, 0.403)
.. (0.148, 0.876)..controls (0.197, 0.986) and (1.203, 2.351)
.. (1.405, 2.450)..controls (1.594, 2.564) and (1.722, 2.598)
.. (1.716, 1.250)..controls (1.722, -0.033) and (1.609, -0.085)
.. (1.405, 0.049)..controls (1.203, 0.149) and (0.197, 1.514)
.. (0.149, 1.624)..controls (-0.137, 2.086) and (-0.067, 2.185)
.. (0.851, 1.921)..controls (1.203, 1.809) and (2.534, 1.455)
.. (2.5000, 1.2500);

Hypotrochoid, fit with 8 Bézier curves. Figure by author.

3. The Fitting Algorithm

This section breaks down Step F2 in greater detail where we fit a Bézier curve, g, to approximate f~. The algorithm is similar to Penner’s algorithm from [3] but with a few modifications. A cubic Bézier curve is parameterized by four points. In the fitting algorithm, we match the endpoints of f~ which gives us two points or four parameters that we can adjust. Let θ denote the adjustable parameters; let B_θ denote the cubic Bézier curve with parameters θ that matches the endpoints f~(a) and f~(b); and define

Basic calculus tells us that s_t must either be an endpoint or it must satisfy the equation

As B_θ is a parametric cubic, the values of s that satisfy the equation are the roots of a quintic which can be easily solved for by finding the eigenvalues of its associated colleague matrix [4].

We can use a Clenshaw Curtis quadrature to approximate the objective function

The gradient and Hessian of h can be computed using the implicit function theorem. See [3] for the equations. We can now put together the fitting algorithm.

Algorithm B (fit a Bézier curve to a function). Given a parametric function f~(t) = (f~_x (t), f~_y (t)), a ≤ t ≤ b, find parameters θ to minimize h(θ).

B1. If possible, select θ_0 so that B_{θ_0} matches the curvature of f~ at a and b; otherwise, select θ_0 so that B_{θ_0} is the line segment that passes through f(a) and f(b).

B2. Starting from θ_0 and using h, ∇h, and ∇^2 h, step a trust-region optimizer [8] until either we’ve come suitably close to an optimum or we’ve exceeded a predetermined maximum number of steps

The major advantage of using a trust-region optimizer for Step B2 is that it won’t get stuck at saddle points. By using second order information combined with an adaptive “trust region”, a trust-region optimizer can still make progress even when ∇h is near zero and ∇^2h is indefinite.

4. The Max Distance Algorithm

This section breaks down Step F3 in greater detail where given a cubic Bézier curve, g, we compute the maximum orthogonal distance from g to f. With s_t defined as in §3, put

and define the following algorithm:

Algorithm M (find maximum orthogonal distance). Given a parametric function f~(t) = (f~_x(t), f~_y(t)), a ≤ t ≤ b, and a Bézier curve g, solve argmax_t r(t).

M1. Set D_max ← 0.

M2. Using the bisection method, start from the interval [a, b] and find a local maximum t_0 of r. Set D_max ← max(D_max, r(t_0)).

M3. Put s_0 ← s_{t_0} and let s_l0′ , s_l0′′ , s_r0′ , and s_r0′′ denote the left and right derivatives for

Note that the left-hand and right-hand values may differ. Define the functions

and

M4. Find the smallest h_l>0 and h_r>0 such that

If suitable h_l and h_r exist, repeat Steps M2 through M4 for [a, t_0 −h_l] and [t_0 +h_r,b]. Otherwise, return D_max.

Remember from Algorithm F that f~ is represented as a Chebyshev series so it’s an easy step to compute the Chebyshev series representations for r~_l and r~_r. Thus, Step M4 can be accomplished by applying Boyd’s root finding algorithm [6].

The figure below shows the result of the max distance algorithm when trying to fit the function f(t)=(.1+t) sin(t), 0 ≤ t ≤ 2π, with only a single Bézier curve,

\draw (0.000, 1.800)..controls (4.484, 4.660) and (8.448, -3.442)..(9.500, 1.800);

We can see that even though there are multiple local optima, Algorithm M correctly identifies the global optimum.

Max error found by Algorithm M when approximating (.1 + t)sin(t), 0 ≤ t ≤ 2π, (black) with a single Bézier curve (blue). Figure by author.

5. How to Produce SVG Images

In this section, I’ll show how to produce an SVG image from the TikZ path that bbai generates. I’ll also show how we can draw axes and annotate. One way to produce SVG images is embed the TikZ drawing commands into a latex document, run lualatex with the –output-format=dvi option, then use dvisvgm to convert the dvi file to an SVG image, as described in the TikZ manual (see §10.2.4 of [9]). However, if the goal is to produce an SVG graphic, I find it easier to to use the MetaPost tool which can output to SVG directly [10].

MetaPost provides a picture drawing language. Using its command line utility mpost, which should be included as part of a TeX Live installation, we can quickly produce an SVG plot. It accepts the same command for drawing paths as TikZ. Here, for instance, is how we would produce an SVG image for the plot from §1.

% plt.mp
outputformat := “svg”;
outputtemplate := “%j-%c.svg”;
prologues:=3;
beginfig(1);
\draw (0.00000, 2.50000) .. controls (0.39088, 2.10882) and (0.61233, -1.05415) ..
(1.18750, 0.36510) .. controls (1.60393, 1.31193) and (1.71252, 2.10800) ..
(2.37500, 0.95127) .. controls (2.88922, 0.15365) and (3.15477, 0.95162) ..
(3.56250, 1.11921) .. controls (3.98299, 1.31658) and (4.26123, 0.78781) ..
(4.75000, 0.82415) .. controls (5.02221, 0.81871) and (5.38203, 1.12388) ..
(5.93750, 0.99939) .. controls (5.96426, 1.01981) and (6.36975, 0.82102) ..
(7.12500, 0.95127) .. controls (8.08129, 1.04760) and (7.44859, 0.87986) ..
(9.50000, 0.96171);
endfig;
end.

Running the command

mpost plt.mp

will produce an SVG for the path as plt-1.svg. We can use the MetaPost commands drawarrow and label to add some axes. Here is source code after rescaling and adding the axes:

% plt.mp
outputformat := “svg”;
outputtemplate := “%j-%c.svg”;
prologues:=3;
beginfig(1);
path pth;
pth := (0.00000, 2.50000) .. controls (0.39088, 2.10882) and (0.61233, -1.05415) ..
(1.18750, 0.36510) .. controls (1.60393, 1.31193) and (1.71252, 2.10800) ..
(2.37500, 0.95127) .. controls (2.88922, 0.15365) and (3.15477, 0.95162) ..
(3.56250, 1.11921) .. controls (3.98299, 1.31658) and (4.26123, 0.78781) ..
(4.75000, 0.82415) .. controls (5.02221, 0.81871) and (5.38203, 1.12388) ..
(5.93750, 0.99939) .. controls (5.96426, 1.01981) and (6.36975, 0.82102) ..
(7.12500, 0.95127) .. controls (8.08129, 1.04760) and (7.44859, 0.87986) ..
(9.50000, 0.96171);
draw pth scaled 50;
numeric xlim, ylim;
xlim := xpart urcorner currentpicture;
ylim := ypart urcorner currentpicture;
drawarrow (-10, -10) — (xlim, -10);
drawarrow (-10,-10) — (-10, ylim);
label.bot(btex $x$ etex, (xlim, -10));
label.lft(btex $y$ etex, (-10, ylim));
endfig;
end.

Example SVG produced with MetaPost. Figure by author.

6. Benchmarks

In this sections I measure how long it takes me to compute Bezier paths for various functions. I didn’t spend a lot of time optimizing the algorithm’s implementation so I’m sure that these numbers could be improved substantially. The main takeaway should be that the algorithm is at least fast enough to be practical for many common cases. The examples are all taken from [4], and all the functions are fit over the range −1 ≤ t ≤ 1.

Benchmarks for fitting Bézier paths to various functions. Figure by author.

7. Conclusions

For analytic or functions that are differentiable to a high degree, we’ve seen that Algorithm F provides an efficient and practical way to generate a minimal representation as a Bézier path, which can then be used to produce a vector graphic.

One area of future work might be to extend the algorithm to work better for functions with kinks (i.e. points where the function is not analytic).

References and Notes

[1] As far as I’m aware, all of the major vector graphics formats (e.g. SVG, postscript) use cubic Bézier curves as the core primitive. While some of the formats provide other basic graphics like circles, etc, those are all just wrappers on top of cubic Bézier curve approximations.

[2] The example is adapted from https://matplotlib.org/stable/gallery/pyplots/pyplot_two_subplots.html.

[3] Alvin Penner. Fitting a cubic Bézier to a parametric function. The College Mathematics Journal, 50(3): 185–196, 2019.

[4] Lloyd N. Trefethen. Approximation theory and approximation practice. SIAM, 2020.

[5] C. A. Hall. On error bounds for spline interpolation. Journal of Approximation Theory, 1: 209–218, 1968.

[6] John Boyd. Computing zeros on a real interval through Chebyshev expansion and polynomial rootfinding.SIAM Journal on Numerical Analysis, 40(5): 1666–1682, 2003.

[7] Jared Aurentz, Lloyd N. Trefethen. Chopping a Chebyshev series. ACM Transactions on Mathematical Software, 43(4): 1–21, 2017.

[8] Jorge Nocedal, Stephen J. Wright. Numerical optimization, second edition. Springer, 2000.

[9] The TikZ and PGF Packages, 2026. https://tikz.dev.

[10] John D. Hobby, Metapost, 2024. https://www.tug.org/docs/metapost/mpman.pdf.

What's Hot

Walmart’s AI workflows meet the realities of the balance sheet

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

Exploring Income Patterns with Python Pandas, Matplotlib, and Seaborn

Walmart’s AI workflows meet the realities of the balance sheet

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

Exploring Income Patterns with Python Pandas, Matplotlib, and Seaborn

How to Fine-Tune LFM2 Using QLoRA and DPO: A Complete Step-by-Step Coding Tutorial on Google Colab

From Local App to Public Website in Minutes

Anthropic IPO filing marks AI maturing into enterprise utility

Walmart’s AI workflows meet the realities of the balance sheet

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

Exploring Income Patterns with Python Pandas, Matplotlib, and Seaborn

Google Will Now Let You Virtually Try on Clothes With Just a Selfie

What’s in a Name? How to Get Your Domain Right

Speed Across the Galaxy Next Year in Star Wars: Galactic Racer

News

Company

Services

What's Hot

How To Produce Ultra-Compact Vector Graphic Plots With Orthogonal Distance Fitting

1. Introduction

2. Functionality Tour

3. The Fitting Algorithm

4. The Max Distance Algorithm

5. How to Produce SVG Images

6. Benchmarks

7. Conclusions

References and Notes

Related Posts

News

Company

Services

Subscribe to Updates