Improve table-style forest plot rendering and subgroup support by takua624 · Pull Request #122 · LSYS/forestplot

takua624 · 2026-04-02T20:10:17Z

Summary of changes

Fixed a row-rendering issue where some rows could be cut off or omitted depending on which annotation columns were displayed.
Fixed a header-rendering issue where annoteheaders could be present internally but not actually appear in the final plot.
Reworked y-axis row placement so plotting no longer depends on Matplotlib’s auto-generated categorical ticks, which could become inconsistent with the number of dataframe rows.
Fixed cases where duplicate y-label values caused rows to collapse or disappear silently.
Improved behavior for all-NaN inputs so the function can return an empty plot gracefully instead of failing.
Added support for weight-scaled markers, using marker size to reflect study weight more directly.
Added subtotal diamonds and subtotal statistics display for grouped analyses.
Added arrows for confidence intervals that extend beyond the plotting range.
Fixed x-axis/tick handling for cases where log scaling is needed even when the plot type is not captured cleanly by the previous MH/IV-based logic.

Motivation

The main motivation was to make the package more reliable for dense, publication-style forest plots where:

the number of plotted rows must match the dataframe exactly,
headers and subgroup summaries need to be displayed consistently,
duplicated study labels should not cause silent row loss,
and edge cases such as empty/all-NaN inputs should fail gracefully.

In my use case, these issues became apparent when generating large batches of forest plots from structured meta-analysis data. In particular, relying on auto-generated y-ticks or inferred axis limits was fragile when headers, duplicate labels, or unusual annotation columns were present. My changes make row placement more explicit and deterministic, which substantially improves robustness for complex plots

Notes

Some of these changes were motivated by reproducing real-world Cochrane forest plots, including subgroup summaries and weighted study markers. The fixes around missing headers, row/tick mismatches, duplicate labels, empty-plot handling, weight-based marker sizing, subtotal diamonds, subtotal statistics, and out-of-range CI arrows all came directly from those use cases.

determine ylim to actually reflect the number of rows needed to display the header, especially when there are few rows in the dataframe (e.g., 3 rows)

Handle dataframes with null values.

Add a parameter to enable drawing marker sizes proportional to study weights.

Let user specify which row(s) contains subtotal information, and draw a horizontal diamond to indicate the CI of the subtotal stats, rather than square&whiskers.

allows the user to specify stats of the total effect, and show underneath the total row.

ignore capitalization for rows containing subtotal stats and info

riginally, the y is specified as dataframe[yticklabel]. This works until there are duplicate values in the yticklabel column. In this case, pyplot skips the duplicated values without yielding any warning. When plotting, we should always specify numerical x-y coordinates!!!

enable color flagging rows with suspicious values

enable consistent padding width across rendereres

takua624 added 18 commits July 13, 2025 13:59

display header row properly

cc1f751

determine ylim to actually reflect the number of rows needed to display the header, especially when there are few rows in the dataframe (e.g., 3 rows)

Update graph_utils.py

09fc9ca

Handle dataframes with null values.

Update graph_utils.py

2776d9a

proportional marker size

2122130

Add a parameter to enable drawing marker sizes proportional to study weights.

total diamond

ca31e2d

Let user specify which row(s) contains subtotal information, and draw a horizontal diamond to indicate the CI of the subtotal stats, rather than square&whiskers.

adding total stats info

29b7596

allows the user to specify stats of the total effect, and show underneath the total row.

formatting

91fc1fe

ignore capitalization for rows containing subtotal stats and info

handle long plot titles (ylabels)

87e32a9

flagging

be26def

enable color flagging rows with suspicious values

Update graph_utils.py

a1fd9cf

try to solve memory leak

c92c81d

minor fix

088cbb7

padding problem

ae9b777

enable consistent padding width across rendereres

minor

6f3e421

minor

a19e617

minor

529314b

minor debugging

4aa680c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve table-style forest plot rendering and subgroup support#122

Improve table-style forest plot rendering and subgroup support#122
takua624 wants to merge 18 commits intoLSYS:mainfrom
takua624:main

takua624 commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

takua624 commented Apr 2, 2026

Summary of changes

Motivation

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant