dmlc
diff --git a/‎doc/tutorials/categorical.rst‎
Lines changed: 18 additions & 6 deletions b/‎doc/tutorials/categorical.rst‎
Lines changed: 18 additions & 6 deletions
@@ -2,6 +2,12 @@
 Categorical Data
 ################
 
+**Contents**
+
+.. contents::
+  :backlinks: none
+  :local:
+
 Since version 1.5, XGBoost has support for categorical data.  For numerical data, the
 split condition is defined as :math:`value < threshold`, while for categorical data the
 split is defined depending on whether partitioning or onehot encoding is used. For
@@ -140,7 +146,7 @@ Auto-recoding (Data Consistency)
 
 .. versionchanged:: 3.1
 
-  Starting with XGBoost 3.1, the *Python* interface can perform automatic re-coding for
+  Starting with XGBoost 3.1, the **Python** interface can perform automatic re-coding for
   new inputs.
 
 XGBoost accepts parameters to indicate which feature is considered categorical, either
@@ -158,11 +164,11 @@ the input and hence cannot store it in the model. The mapping usually happens in
 users' data engineering pipeline. To ensure the correct result from XGBoost, users need to
 keep the pipeline for transforming data consistent across training and testing data.
 
-Starting with 3.1, the *Python* interface can remember the encoding and perform recoding
+Starting with 3.1, the **Python** interface can remember the encoding and perform recoding
 during inference and training continuation when the input is a dataframe (`pandas`,
 `cuDF`, `polars`, `pyarrow`, `modin`). The feature support focuses on basic usage. It has
-some restrictions on the types of inputs that can be accepted. First, category names
-must have one of the following types:
+some restrictions on the types of inputs that can be accepted. First, category names must
+have one of the following types:
 
 - string
 - integer, from 8-bit to 64-bit, both signed and unsigned are supported.
@@ -224,9 +230,15 @@ of using the native interface:
 No extra step is required for using the scikit-learn interface as long as the inputs are
 dataframes. During training continuation, XGBoost will either extract the categories from
 the previous model or use the categories from the new training dataset if the input model
-doesn't have the information.
+doesn't have the information. As a side note, users can inspect the content of the
+categories by exporting it to arrow arrays. This interface is still experimental:
+
+.. code-block:: python
+
+  categories = booster.get_categories(export_to_arrow=True)
+  print(categories.to_arrow())
 
-For R, the auto-recoding is not yet supported as of 3.1. To provide an example:
+For **R**, the auto-recoding is not yet supported as of 3.1. To provide an example:
 
 .. code-block:: R