You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: docs/publishing.rst
+15-5
Original file line number
Diff line number
Diff line change
@@ -143,13 +143,21 @@ Publish Set IDs will be used to organzize the exported XML files in the resultin
143
143
Published Subsets
144
144
=================
145
145
146
-
Published Subsets are user defined subsets of all currently published Records and Jobs in Combine. They are created by selecting what Publish Set Identifiers to include in the subset (optionally including all Jobs without a Publish Set Identifier). As Combine strives to be a single point of interaction for metadata harvesting, transformation, and publishing, it is expected that users may desire to expose only certain subsets of published records to downstream, non-Combine users. Published Subsets allow for this.
146
+
Published Subsets are user defined subsets of all currently published Records and Jobs in Combine. They are created by selecting a combination of:
147
+
148
+
- **Publish Set Identifiers** to include in the subset
149
+
- all published Jobs **without** a Publish Set Identifier
150
+
- **Organizations, Record Groups, and Jobs** where all *published* Jobs are included
151
+
152
+
As Combine strives to be a single point of interaction for metadata harvesting, transformation, and publishing, it is expected that users may desire to expose only certain subsets of published records to downstream, non-Combine users. Published Subsets allow for this.
147
153
148
154
For example, imagine a single instance of Combine that is used to harvest, transform, QA, and publish metadata in support of a DPLA service hub. It may be convenient to *also* use this instance of Combine in support of a digital collection state portal. While there may be overlap in what Records and Jobs are published to both DPLA and the state portal, there may be some metadata records that should only propagate to one, but not the other.
149
155
150
156
By default, the built-in OAI-PMH server, and flat file exports, expose *all* published Records in Combine. For many use cases, this might be perfectly acceptable. Or, it may be such that careful use of Publish Set Identifiers -- which translate directly to OAI sets -- may be sufficient for managing that downstream consumers only harvest apporpriate records.
151
157
152
-
If, however, this is not the case, and more granular control is need, Published Subsets may be a good option for selecting subsets of published Records, which are then exposed through their own unique OAI-PMH endpoint, or flat file exports. In this scenario, the records bound for DPLA might be available through ``/oai/subset/dpla``, while the records bound for the state portal could be available for harvest from ``/oai/subset/state_portal``.
158
+
If, however, this is not the case, and more granular control is need, Published Subsets may be a good option for selecting subsets of published Records, which are then exposed through their own unique OAI-PMH endpoint, or flat file exports. In this scenario, the records bound for DPLA might be available through the subset ``dpla`` and the OAI endpoint ``/oai/subset/dpla``, while the records bound for the state portal could be available in the subset ``start_portal`` and available for OAI harvest from ``/oai/subset/state_portal``.
159
+
160
+
All Published Subsets also allow the normal exporting of Records (flat XML, S3, etc.).
153
161
154
162
155
163
Viewing Published Subsets
@@ -207,12 +215,14 @@ To create a Published Subset, click "Create Published Subset" at the bottom, whe
207
215
208
216
- This is where published sets are selected to include in this Published Subset. All or none may be included.
209
217
210
-
- **Include Records without Publish Set Identifier**
211
-
212
-
- This toggle will include Jobs/Records that have not been given a Publish Set Identifier in this Published Subset.
218
+
- **Select Organizations, Record Groups, and Jobs**
213
219
220
+
- Select Organizations, Record Groups, and Jobs to include in Published Subset. Selected Organizations and Record Groups will include all *published* Jobs that fall underneath. In the instance where only select Jobs from a Record Group are selected, only those Jobs will be included, *not* the entire Record Group.
221
+
- **Note:** This is particularly helpful if a user wants to add an entire Organization or Record Group to a subset, confident that all Jobs created or deleted, published or unpublished, will collect under this Published Subset.
214
222
223
+
- **Include Records without Publish Set Identifier**
215
224
225
+
- This toggle will include Jobs/Records that have not been given a Publish Set Identifier in this Published Subset.
0 commit comments