mongodb
diff --git a/‎src/change_stream/event.rs
Lines changed: 18 additions & 0 deletions b/‎src/change_stream/event.rs
Lines changed: 18 additions & 0 deletions
diff --git a/‎src/test/spec/json/change-streams/README.md
Lines changed: 262 additions & 0 deletions b/‎src/test/spec/json/change-streams/README.md
Lines changed: 262 additions & 0 deletions
@@ -71,6 +71,9 @@ pub struct ChangeStreamEvent<T> {
     /// Identifies the collection or database on which the event occurred.
     pub ns: Option<ChangeNamespace>,
 
+    /// The type of the newly created object.  Only included for `OperationType::Create`.
+    pub ns_type: Option<ChangeNamespaceType>,
+
     /// The new name for the `ns` collection.  Only included for `OperationType::Rename`.
     pub to: Option<ChangeNamespace>,
 
@@ -268,3 +271,18 @@ pub struct ChangeNamespace {
     /// The name of the collection in which the change occurred.
     pub coll: Option<String>,
 }
+
+/// Identifies the type of object for a `create` event.
+#[derive(Serialize, Deserialize, Debug, PartialEq, Eq)]
+#[non_exhaustive]
+pub enum ChangeNamespaceType {
+    /// A collection with no special options set.
+    Collection,
+    /// A timeseries collection.
+    Timeseries,
+    /// A view collection.
+    View,
+    /// Forward compatibility fallthrough.
+    #[serde(untagged)]
+    Other(String),
+}
@@ -0,0 +1,262 @@
+# Change Streams
+
+______________________________________________________________________
+
+## Introduction
+
+The YAML and JSON files in this directory are platform-independent tests that drivers can use to prove their conformance
+to the Change Streams Spec.
+
+Several prose tests, which are not easily expressed in YAML, are also presented in this file. Those tests will need to
+be manually implemented by each driver.
+
+### Subdirectories for Test Formats
+
+This document describes the legacy format for change streams tests. Tests in this legacy format are located under
+`./legacy/`.
+
+New change streams tests should be written in the
+[unified test format](../../unified-test-format/unified-test-format.md) and placed under `./unified/`.
+
+## Spec Test Format
+
+Each YAML file has the following keys:
+
+- `database_name`: The default database
+- `collection_name`: The default collection
+- `database2_name`: Another database
+- `collection2_name`: Another collection
+- `tests`: An array of tests that are to be run independently of each other. Each test will have some of the following
+    fields:
+    - `description`: The name of the test.
+    - `minServerVersion`: The minimum server version to run this test against. If not present, assume there is no minimum
+        server version.
+    - `maxServerVersion`: Reserved for later use
+    - `failPoint`: Optional configureFailPoint command document to run to configure a fail point on the primary server.
+    - `target`: The entity on which to run the change stream. Valid values are:
+        - `collection`: Watch changes on collection `database_name.collection_name`
+        - `database`: Watch changes on database `database_name`
+        - `client`: Watch changes on entire clusters
+    - `topology`: An array of server topologies against which to run the test. Valid topologies are `single`,
+        `replicaset`, `sharded`, and `load-balanced`.
+    - `changeStreamPipeline`: An array of additional aggregation pipeline stages to add to the change stream
+    - `changeStreamOptions`: Additional options to add to the changeStream
+    - `operations`: Array of documents, each describing an operation. Each document has the following fields:
+        - `database`: Database against which to run the operation
+        - `collection`: Collection against which to run the operation
+        - `name`: Name of the command to run
+        - `arguments` (optional): Object of arguments for the command (ex: document to insert)
+    - `expectations`: Optional list of command-started events in Extended JSON format
+    - `result`: Document with ONE of the following fields:
+        - `error`: Describes an error received during the test
+        - `success`: An Extended JSON array of documents expected to be received from the changeStream
+
+## Spec Test Match Function
+
+The definition of MATCH or MATCHES in the Spec Test Runner is as follows:
+
+- MATCH takes two values, `expected` and `actual`
+- Notation is "Assert \[actual\] MATCHES \[expected\]
+- Assertion passes if `expected` is a subset of `actual`, with the value `42` acting as placeholders for "any value"
+
+Pseudocode implementation of `actual` MATCHES `expected`:
+
+```text
+If expected is "42" or 42:
+  Assert that actual exists (is not null or undefined)
+Else:
+  Assert that actual is of the same JSON type as expected
+  If expected is a JSON array:
+    For every idx/value in expected:
+      Assert that actual[idx] MATCHES value
+  Else if expected is a JSON object:
+    For every key/value in expected
+      Assert that actual[key] MATCHES value
+  Else:
+    Assert that expected equals actual
+```
+
+The expected values for `result.success` and `expectations` are written in Extended JSON. Drivers may adopt any of the
+following approaches to comparisons, as long as they are consistent:
+
+- Convert `actual` to Extended JSON and compare to `expected`
+- Convert `expected` and `actual` to BSON, and compare them
+- Convert `expected` and `actual` to native equivalents of JSON, and compare them
+
+## Spec Test Runner
+
+Before running the tests
+
+- Create a MongoClient `globalClient`, and connect to the server. When executing tests against a sharded cluster,
+    `globalClient` must only connect to one mongos. This is because tests that set failpoints will only work
+    consistently if both the `configureFailPoint` and failing commands are sent to the same mongos.
+
+For each YAML file, for each element in `tests`:
+
+- If `topology` does not include the topology of the server instance(s), skip this test.
+- Use `globalClient` to
+    - Drop the database `database_name`
+    - Drop the database `database2_name`
+    - Create the database `database_name` and the collection `database_name.collection_name`
+    - Create the database `database2_name` and the collection `database2_name.collection2_name`
+    - If the the `failPoint` field is present, configure the fail point on the primary server. See
+        [Server Fail Point](../../transactions/tests/legacy-test-format.md#server-fail-point) in the Transactions spec
+        test documentation for more information.
+- Create a new MongoClient `client`
+- Begin monitoring all APM events for `client`. (If the driver uses global listeners, filter out all events that do not
+    originate with `client`). Filter out any "internal" commands (e.g. `hello` or legacy hello)
+- Using `client`, create a changeStream `changeStream` against the specified `target`. Use `changeStreamPipeline` and
+    `changeStreamOptions` if they are non-empty. Capture any error.
+- If there was no error, use `globalClient` and run every operation in `operations` in serial against the server until
+    all operations have been executed or an error is thrown. Capture any error.
+- If there was no error and `result.error` is set, iterate `changeStream` once and capture any error.
+- If there was no error and `result.success` is non-empty, iterate `changeStream` until it returns as many changes as
+    there are elements in the `result.success` array or an error is thrown. Capture any error.
+- Close `changeStream`
+- If there was an error:
+    - Assert that an error was expected for the test.
+    - Assert that the error MATCHES `result.error`
+- Else:
+    - Assert that no error was expected for the test
+    - Assert that the changes received from `changeStream` MATCH the results in `result.success`
+- If there are any `expectations`
+    - For each (`expected`, `idx`) in `expectations`
+        - If `actual[idx]` is a `killCursors` event, skip it and move to `actual[idx+1]`.
+        - Else assert that `actual[idx]` MATCHES `expected`
+    - Note: the change stream test command event expectations cover a prefix subset of all command events published by the
+        driver. The test runner MUST verify that, if there are N expectations, that the first N events published by the
+        driver match the expectations, and MUST NOT inspect any subsequent events published by the driver.
+- Close the MongoClient `client`
+
+After running all tests
+
+- Close the MongoClient `globalClient`
+- Drop database `database_name`
+- Drop database `database2_name`
+
+### Iterating the Change Stream
+
+Although synchronous drivers must provide a
+[non-blocking mode of iteration](../change-streams.md#not-blocking-on-iteration), asynchronous drivers may not have such
+a mechanism. Those drivers with only a blocking mode of iteration should be careful not to iterate the change stream
+unnecessarily, as doing so could cause the test runner to block indefinitely. For this reason, the test runner procedure
+above advises drivers to take a conservative approach to iteration.
+
+If the test expects an error and one was not thrown by either creating the change stream or executing the test's
+operations, iterating the change stream once allows for an error to be thrown by a `getMore` command. If the test does
+not expect any error, the change stream should be iterated only until it returns as many result documents as are
+expected by the test.
+
+### Testing on Sharded Clusters
+
+When writing data on sharded clusters, majority-committed data does not always show up in the response of the first
+`getMore` command after the data is written. This is because in sharded clusters, no data from shard A may be returned
+until all other shard reports an entry that sorts after the change in shard A.
+
+To account for this, drivers MUST NOT rely on change stream documents in certain batches. For example, if expecting two
+documents in a change stream, these may not be part of the same `getMore` response, or even be produced in two
+subsequent `getMore` responses. Drivers MUST allow for a `getMore` to produce empty batches when testing on a sharded
+cluster. By default, this can take up to 10 seconds, but can be controlled by enabling the `writePeriodicNoops` server
+parameter and configuring the `periodNoopIntervalSecs` parameter. Choosing lower values allows for running change stream
+tests with smaller timeouts.
+
+## Prose Tests
+
+The following tests have not yet been automated, but MUST still be tested. All tests SHOULD be run on both replica sets
+and sharded clusters unless otherwise specified:
+
+1. `ChangeStream` must continuously track the last seen `resumeToken`
+
+2. `ChangeStream` will throw an exception if the server response is missing the resume token (if wire version is \< 8,
+    this is a driver-side error; for 8+, this is a server-side error)
+
+3. After receiving a `resumeToken`, `ChangeStream` will automatically resume one time on a resumable error with the
+    initial pipeline and options, except for the addition/update of a `resumeToken`.
+
+4. `ChangeStream` will not attempt to resume on any error encountered while executing an `aggregate` command. Note that
+    retryable reads may retry `aggregate` commands. Drivers should be careful to distinguish retries from resume
+    attempts. Alternatively, drivers may specify `retryReads=false` or avoid using a
+    [retryable error](../../retryable-reads/retryable-reads.md#retryable-error) for this test.
+
+5. **Removed**
+
+6. `ChangeStream` will perform server selection before attempting to resume, using initial `readPreference`
+
+7. Ensure that a cursor returned from an aggregate command with a cursor id and an initial empty batch is not closed on
+    the driver side.
+
+8. The `killCursors` command sent during the "Resume Process" must not be allowed to throw an exception.
+
+9. `$changeStream` stage for `ChangeStream` against a server `>=4.0` and `<4.0.7` that has not received any results yet
+    MUST include a `startAtOperationTime` option when resuming a change stream.
+
+10. **Removed**
+
+11. For a `ChangeStream` under these conditions:
+
+    - Running against a server `>=4.0.7`.
+    - The batch is empty or has been iterated to the last document.
+
+    Expected result:
+
+    - `getResumeToken` must return the `postBatchResumeToken` from the current command response.
+
+12. For a `ChangeStream` under these conditions:
+
+    - Running against a server `<4.0.7`.
+    - The batch is empty or has been iterated to the last document.
+
+    Expected result:
+
+    - `getResumeToken` must return the `_id` of the last document returned if one exists.
+    - `getResumeToken` must return `resumeAfter` from the initial aggregate if the option was specified.
+    - If `resumeAfter` was not specified, the `getResumeToken` result must be empty.
+
+13. For a `ChangeStream` under these conditions:
+
+    - The batch is not empty.
+    - The batch has been iterated up to but not including the last element.
+
+    Expected result:
+
+    - `getResumeToken` must return the `_id` of the previous document returned.
+
+14. For a `ChangeStream` under these conditions:
+
+    - The batch is not empty.
+    - The batch hasn’t been iterated at all.
+    - Only the initial `aggregate` command has been executed.
+
+    Expected result:
+
+    - `getResumeToken` must return `startAfter` from the initial aggregate if the option was specified.
+    - `getResumeToken` must return `resumeAfter` from the initial aggregate if the option was specified.
+    - If neither the `startAfter` nor `resumeAfter` options were specified, the `getResumeToken` result must be empty.
+
+    Note that this test cannot be run against sharded topologies because in that case the initial `aggregate` command
+    only establishes cursors on the shards and always returns an empty `firstBatch`.
+
+15. **Removed**
+
+16. **Removed**
+
+17. `$changeStream` stage for `ChangeStream` started with `startAfter` against a server `>=4.1.1` that has not received
+    any results yet MUST include a `startAfter` option and MUST NOT include a `resumeAfter` option when resuming a
+    change stream.
+
+18. `$changeStream` stage for `ChangeStream` started with `startAfter` against a server `>=4.1.1` that has received at
+    least one result MUST include a `resumeAfter` option and MUST NOT include a `startAfter` option when resuming a
+    change stream.
+
+19. Validate that large `ChangeStream` events are split when using `$changeStreamSplitLargeEvent`:
+
+    1. Run only against servers `>=6.0.9 && <6.1` or `>=7.0`.
+    2. Create a new collection `_[C]()` with `changeStreamPreAndPostImages` enabled.
+    3. Insert into `_[C]()` a document at least 10mb in size, e.g. `{ "value": "q"*10*1024*1024 }`
+    4. Create a change stream `_[S]()` by calling `watch` on `_[C]()` with pipeline
+        `[{ "$changeStreamSplitLargeEvent": {} }]` and `fullDocumentBeforeChange=required`.
+    5. Call `updateOne` on `_[C]()` with an empty `query` and an update setting the field to a new large value, e.g.
+        `{ "$set": { "value": "z"*10*1024*1024 } }`.
+    6. Collect two events from `_[S]()`.
+    7. Assert that the events collected have `splitEvent` fields `{ "fragment": 1, "of": 2 }` and
+        `{ "fragment": 2, "of": 2 }`, in that order.