A Pandoc filter to include figures generated from Python code blocks.
A Pandoc filter to include figures generated from Python code blocks. Keep the document and Python code in the same location. Output is captured and included as a figure.
pandoc-pyplot - A Pandoc filter to generate Matplotlib/Plotly figures directly in documents
pandoc-pyplot
turns Python code present in your documents into embedded figures via Matplotlib or Plotly.
Usage
Markdown
The filter recognizes code blocks with the .pyplot
or .plotly
classes present in Markdown documents. It will run the script in the associated code block in a Python interpreter and capture the generated Matplotlib/Plotly figure.
Here is a basic example using the scripting matplotlib.pyplot
API:
```{.pyplot}
import matplotlib.pyplot as plt
plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
```
Putting the above in input.md
, we can then generate the plot and embed it:
pandoc --filter pandoc-pyplot input.md --output output.html
or
pandoc --filter pandoc-pyplot input.md --output output.pdf
or any other output format you want.
LaTeX
The filter works slightly differently in LaTeX documents. In LaTeX, the minted
environment must be used, with the pyplot
class.
\begin{minted}{pyplot}
import matplotlib.pyplot as plt
plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
\end{minted}
Note that you do not need to have minted
installed.
Examples
There are more examples in the source repository, in the \examples
directory.
Features
Captions
You can also specify a caption for your image. This is done using the optional caption
parameter.
Markdown:
```{.pyplot caption="This is a simple figure"}
import matplotlib.pyplot as plt
plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
```
LaTex:
\begin{minted}[caption=This is a simple figure]{pyplot}
import matplotlib.pyplot as plt
plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
\end{minted}
Caption formatting is either plain text or Markdown. LaTeX-style math is also support in captions (using dollar signs $...$).
Link to source code and high-resolution figure
In case of an output format that supports links (e.g. HTML), the embedded image generated by pandoc-pyplot
will be a link to the source code which was used to generate the file. Therefore, other people can see what Python code was used to create your figures. A high resolution image will be made available in a caption link.
(New in version 2.1.3.0) For cleaner output (e.g. PDF), you can turn this off via the links=false
key:
Markdown:
```{.pyplot links=false}
...
```
LaTex:
\begin{minted}[links=false]{pyplot}
...
\end{minted}
or via a configuration file.
Including scripts
If you find yourself always repeating some steps, inclusion of scripts is possible using the include
parameter. For example, if you want all plots to have the ggplot
style, you can write a very short preamble style.py
like so:
import matplotlib.pyplot as plt
plt.style.use('ggplot')
and include it in your document as follows:
```{.pyplot include=style.py}
plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
```
Which is equivalent to writing the following markdown:
```{.pyplot}
import matplotlib.pyplot as plt
plt.style.use('ggplot')
plt.figure()
plt.plot([0,1,2,3,4], [1,2,3,4,5])
plt.title('This is an example figure')
```
The equivalent LaTeX usage is as follows:
\begin{minted}[include=style.py]{pyplot}
\end{minted}
This include
parameter is perfect for longer documents with many plots. Simply define the style you want in a separate script! You can also import packages this way, or define functions you often use.
Customization of figures beyond what is available in pandoc-pyplot
can also be done through the include
script. For example, if you wanted to figures with a black background, you can do so via matplotlib.pyplot.rcParams
:
import matplotlib.pyplot as plt
plt.rcParams['savefig.facecolor'] = 'k'
...
You can take a look at all available matplotlib
parameters here.
Multiple backends
(new in version 2.2.0.0) Both Matplotlib and Plotly are supported!
To render Plotly figures in Markdown:
```{.plotly caption="This is a Plotly figure"}
import plotly.graph_objects as go
figure = go.Figure(
data=[go.Bar(y=[2, 1, 3])],
)
Here is the LaTeX equivalent:
\begin{minted}[caption=This is a Plotly figure]{plotly}
import plotly.graph_objects as go
figure = go.Figure(
data=[go.Bar(y=[2, 1, 3])],
)
\end{minted}
pandoc-pyplot
will render and capture your figure automagically.
No wasted work
pandoc-pyplot
minimizes work, only generating figures if it absolutely must. Therefore, you can confidently run the filter on very large documents containing dozens of figures --- like a book or a thesis --- and only the figures which have changed will be re-generated.
Compatibility with pandoc-crossref
pandoc-crossref
is a pandoc filter that makes it effortless to cross-reference objects in Markdown documents.
You can use pandoc-crossref
in conjunction with pandoc-pyplot
for the ultimate figure-making pipeline. You can combine both in a figure like so:
```{#fig:myexample .pyplot caption="This is a caption"}
# Insert figure script here
```
As you can see in @fig:myexample, ...
If the above source is located in file myfile.md
, you can render the figure and references by applying pandoc-pyplot
first, and then pandoc-crossref
. For example:
pandoc --filter pandoc-pyplot --filter pandoc-crossref -i myfile.md -o myfile.html
Configurable
(New in version 2.1.0.0) To avoid repetition, pandoc-pyplot
can be configured using simple YAML files. pandoc-pyplot
will look for a .pandoc-pyplot.yml
file in the current working directory:
# You can specify any or all of the following parameters
interpreter: python36
directory: mydirectory/
include: mystyle.py
format: jpeg
links: false
dpi: 150 # Matplotlib only
tight_bbox: true # Matplotlib only
transparent: false # Matplotlib only
flags: [-O, -Wignore]
These values override the default values, which are equivalent to:
# Defaults if no configuration is provided.
# Note that the default interpreter name on MacOS and Unix is 'python3'
# and 'python' on Windows.
interpreter: python
flags: []
directory: generated/
format: png
links: true
dpi: 80
tight_bbox: false
transparent: false
Using pandoc-pyplot --write-example-config
will write the default configuration to a file .pandoc-pyplot.yml
, which you can then customize.
Configuration-only parameters
There are a few parameters that are only available via the configuration file .pandoc-pyplot.yml
:
interpreter
is the name of the interpreter to use. For example,interpreter: python36
;flags
is a list of strings, which are flags that are passed to the python interpreter. For example,flags: [-O, -Wignore]
;- (New in version 2.1.5.0)
tight_bbox
is a boolean that determines whether to usebbox_inches="tight"
or not when saving Matplotlib figures. For example,tight_bbox: true
. See here for details. This is ignored for Plotly figures. - (New in version 2.1.5.0)
transparent
is a boolean that determines whether to make Matplotlib figure background transparent or not. This is useful, for example, for displaying a plot on top of a colored background on a web page. High-resolution figures are not affected. For example,transparent: true
. This is ignored for Plotly figures.
Installation
Binaries
Windows binaries are available on GitHub. Place the executable in a location that is in your PATH to be able to call it.
If you can show me how to generate binaries for other platform using e.g. Azure Pipelines, let me know!
Installers (Windows)
Windows installers are made available thanks to Inno Setup. You can download them from the release page.
From Hackage/Stackage
pandoc-pyplot
is available on Hackage. Using the cabal-install
tool:
cabal update
cabal install pandoc-pyplot
Similarly, pandoc-pyplot
is available on Stackage:
stack update
stack install pandoc-pyplot
From source
Building from source can be done using stack
or cabal
:
git clone https://github.com/LaurentRDC/pandoc-pyplot
cd pandoc-pylot
stack install # Alternatively, `cabal install`
Running the filter
Requirements
This filter requires a Python interpreter and at least Matplotlib or Plotly installed. The name of the Python interpreter to use can be specified in a .pandoc-pyplot.yml
file; by default, pandoc-pyplot
will use the "python"
name on Windows, and "python3"
otherwise.
Use the filter with Pandoc as follows:
pandoc --filter pandoc-pyplot input.md --output output.html
in which case, the output is HTML. Another example with PDF output:
pandoc --filter pandoc-pyplot input.md --output output.pdf
Python exceptions will be printed to screen in case of a problem.
pandoc-pyplot
has a limited command-line interface. Take a look at the help available using the -h
or --help
argument:
pandoc-pyplot --help
Usage as a Haskell library
To include the functionality of pandoc-pyplot
in a Haskell package, you can use the makePlot :: Block -> IO Block
function (for single blocks) or plotTransform :: Pandoc -> IO Pandoc
function (for entire documents). Variations of these functions exist for more advanced configurations. Take a look at the documentation on Hackage.
Usage with Hakyll
This filter was originally designed to be used with Hakyll. In case you want to use the filter with your own Hakyll setup, you can use a transform function that works on entire documents:
import Text.Pandoc.Filter.Pyplot (plotTransform)
import Hakyll
-- Unsafe compiler is required because of the interaction
-- in IO (i.e. running an external Python script).
makePlotPandocCompiler :: Compiler (Item String)
makePlotPandocCompiler =
pandocCompilerWithTransformM
defaultHakyllReaderOptions
defaultHakyllWriterOptions
(unsafeCompiler . plotTransform)
The plotTransformWithConfig
is also available for a more configurable set-up.
Warning
Do not run this filter on unknown documents. There is nothing in pandoc-pyplot
that can stop a Python script from performing evil actions.