Skip to content

Conditional Distributions #140

@harriet-mason

Description

@harriet-mason

Hi, me again.

What is your stance on implementing conditional distributions in distributional? I assume you have had this conversation before, but I did search the GitHub issues for the word "conditional" before posting this, and nothing came up.

Specifically, I have been thinking about this example from the ggdibbler vignette:

library(ggdibbler)
library(ggplot2)
library(fable)
#> Loading required package: fabletools
#> Registered S3 method overwritten by 'tsibble':
#>   method               from 
#>   as_tibble.grouped_df dplyr
forecast <- as_tsibble(sunspot.year) |> 
  model(ARIMA(value)) |> 
  forecast(h = "10 years") 

ggplot(forecast) +
  geom_line_sample(aes(x = index, y = value),
                       times=100, alpha=0.5)

Created on 2026-01-01 with reprex v2.1.1

Since ggdibbler just assumes independence between distributions and joins them up with a line, the time series looks more erratic than it probably should. I have considered assuming a conditional distribution if the plot type (e.g. line plot) implies inexchangeability, but I actually don't even know how I would implement that. Conditional sampling would require some heavy-handed assumptions about the model and data that are not always justified.

Since the inexchangeability is a feature of the data itself (in this case, time series), I wonder if this would be better implemented in distributional?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions