Skip to content

Commit 95cb360

Browse files
authored
Merge pull request #4546 from rl-utility-man/patch-4
proposed addition of a df.agg stacked go.bar example
2 parents 4fc1402 + 666b601 commit 95cb360

File tree

1 file changed

+50
-0
lines changed

1 file changed

+50
-0
lines changed

doc/python/bar-charts.md

+50
Original file line numberDiff line numberDiff line change
@@ -304,6 +304,56 @@ fig.update_layout(barmode='stack')
304304
fig.show()
305305
```
306306

307+
### Stacked Bar Chart From Aggregating a DataFrame
308+
309+
Stacked bar charts are a powerful way to present results summarizing categories generated using the Pandas aggregate commands. `pandas.DataFrame.agg` produces a wide data set format incompatible with `px.bar`. Transposing and updating the indexes to achieve `px.bar` compatibility is a somewhat involved option. Here is one straightforward alternative, which presents the aggregated data as a stacked bar using plotly.graph_objects.
310+
```
311+
312+
from plotly import graph_objects as go
313+
import pandas as pd
314+
315+
# Get one year of gapminder data
316+
url = 'https://raw.githubusercontent.com/plotly/datasets/master/gapminderDataFiveYear.csv'
317+
df = pd.read_csv(url)
318+
df = df[df['year']==2007]
319+
df["gdp"]=df["pop"]*df['gdpPercap']
320+
321+
322+
# Build the summary of interest
323+
df_summarized = df.groupby("continent", observed=True).agg("sum").reset_index()
324+
325+
df_summarized["percent of world population"]=100*df_summarized["pop"]/df_summarized["pop"].sum()
326+
df_summarized["percent of world GDP"]=100*df_summarized["gdp"]/df_summarized["gdp"].sum()
327+
328+
329+
df = df_summarized[["continent",
330+
"percent of world population",
331+
"percent of world GDP",
332+
]]
333+
334+
# We now have a wide data frame, but it's in the opposite orientation from the one that px is designed to deal with.
335+
# Transposing it and rebuilding the indexes is an option, but iterating through the DF using graph objects is more succinct.
336+
337+
fig=go.Figure()
338+
for category in df_summarized["continent"].values:
339+
fig.add_trace(go.Bar(
340+
x=df.columns[1:],
341+
# We need to get a pandas series that contains just the values to graph;
342+
# We do so by selecting the right row, selecting the right columns
343+
# and then transposing and using iloc to convert to a series
344+
# Here, we assume that the bar element category variable is in column 0
345+
y=list(df.loc[df["continent"]==category][list(df.columns[1:])].transpose().iloc[:,0]),
346+
name=str(category)
347+
348+
349+
)
350+
)
351+
fig.update_layout(barmode="stack")
352+
353+
fig.show()
354+
```
355+
356+
307357
### Bar Chart with Hover Text
308358

309359
```python

0 commit comments

Comments
 (0)