adding metadata of swt-detection.v7.1

clams-bot · clams-bot · commit 4ccc45d80b46 · 2024-11-25T18:27:38.000Z
diff --git a/docs/_apps/swt-detection/v7.1/index.md b/docs/_apps/swt-detection/v7.1/index.md
@@ -0,0 +1,180 @@
+---
+layout: posts
+classes: wide
+title: "Scenes-with-text Detection (v7.1)"
+date: 2024-11-25T18:27:38+00:00
+---
+## About this version
+
+- Submitter: [keighrim](https://github.com/keighrim)
+- Submission Time: 2024-11-25T18:27:38+00:00
+- Prebuilt Container Image: [ghcr.io/clamsproject/app-swt-detection:v7.1](https://github.com/clamsproject/app-swt-detection/pkgs/container/app-swt-detection/v7.1)
+- Release Notes
+
+    > Release with newly trained models:  
+    > - training data is expanded with [new annotations](https://github.com/clamsproject/aapb-annotations/pull/98).  
+    > - label `U` is added, total number of "raw" labels is now 18.  
+    > - in additional to `convnext_lg` and `convnext_tiny`, `convnext_small`-based models are added. The default is now `convnext_small` model.
+
+## About this app (See raw [metadata.json](metadata.json))
+
+**Detects scenes with text, like slates, chyrons and credits. This app can run in three modes, depending on `useClassifier`, `useStitcher` parameters. When `useClassifier=True`, it runs in the "TimePoint mode" and generates TimePoint annotations. When `useStitcher=True`, it runs in the "TimeFrame mode" and generates TimeFrame annotations based on existing TimePoint annotations -- if no TimePoint is found, it produces an error. By default, it runs in the 'both' mode and first generates TimePoint annotations and then TimeFrame annotations on them.**
+
+- App ID: [http://apps.clams.ai/swt-detection/v7.1](http://apps.clams.ai/swt-detection/v7.1)
+- App License: Apache 2.0
+- Source Repository: [https://github.com/clamsproject/app-swt-detection](https://github.com/clamsproject/app-swt-detection) ([source tree of the submitted version](https://github.com/clamsproject/app-swt-detection/tree/v7.1))
+
+
+#### Inputs
+(**Note**: "*" as a property value means that the property is required but can be any value.)
+
+- [http://mmif.clams.ai/vocabulary/VideoDocument/v1](http://mmif.clams.ai/vocabulary/VideoDocument/v1) (required)
+(of any properties)
+
+
+
+#### Configurable Parameters
+(**Note**: _Multivalued_ means the parameter can have one or more values.)
+
+- `useClassifier`: optional, defaults to `true`
+
+    - Type: boolean
+    - Multivalued: False
+    - Choices: `false`, **_`true`_**
+
+
+    > Use the image classifier model to generate TimePoint annotations.
+- `tpModelName`: optional, defaults to `convnext_small`
+
+    - Type: string
+    - Multivalued: False
+    - Choices: `convnext_lg`, **_`convnext_small`_**, `convnext_tiny`
+
+
+    > Model name to use for classification, only applies when `useClassifier=true`.
+- `tpUsePosModel`: optional, defaults to `true`
+
+    - Type: boolean
+    - Multivalued: False
+    - Choices: `false`, **_`true`_**
+
+
+    > Use the model trained with positional features, only applies when `useClassifier=true`.
+- `tpStartAt`: optional, defaults to `0`
+
+    - Type: integer
+    - Multivalued: False
+
+
+    > Number of milliseconds into the video to start processing, only applies when `useClassifier=true`.
+- `tpStopAt`: optional, defaults to `9223372036854775807`
+
+    - Type: integer
+    - Multivalued: False
+
+
+    > Number of milliseconds into the video to stop processing, only applies when `useClassifier=true`.
+- `tpSampleRate`: optional, defaults to `1000`
+
+    - Type: integer
+    - Multivalued: False
+
+
+    > Milliseconds between sampled frames, only applies when `useClassifier=true`.
+- `useStitcher`: optional, defaults to `true`
+
+    - Type: boolean
+    - Multivalued: False
+    - Choices: `false`, **_`true`_**
+
+
+    > Use the stitcher after classifying the TimePoints.
+- `tfMinTPScore`: optional, defaults to `0.5`
+
+    - Type: number
+    - Multivalued: False
+
+
+    > Minimum score for a TimePoint to be included in a TimeFrame. A lower value will include more TimePoints in the TimeFrame (increasing recall in exchange for precision). Only applies when `useStitcher=true`.
+- `tfMinTFScore`: optional, defaults to `0.9`
+
+    - Type: number
+    - Multivalued: False
+
+
+    > Minimum score for a TimeFrame. A lower value will include more TimeFrames in the output (increasing recall in exchange for precision). Only applies when `useStitcher=true`
+- `tfMinTFDuration`: optional, defaults to `5000`
+
+    - Type: integer
+    - Multivalued: False
+
+
+    > Minimum duration of a TimeFrame in milliseconds, only applies when `useStitcher=true`.
+- `tfAllowOverlap`: optional, defaults to `false`
+
+    - Type: boolean
+    - Multivalued: False
+    - Choices: **_`false`_**, `true`
+
+
+    > Allow overlapping time frames, only applies when `useStitcher=true`
+- `tfDynamicSceneLabels`: optional, defaults to `['credit', 'credits']`
+
+    - Type: string
+    - Multivalued: True
+
+
+    > Labels that are considered dynamic scenes. For dynamic scenes, TimeFrame annotations contains multiple representative points to follow any changes in the scene. Only applies when `useStitcher=true`
+- `tfLabelMap`: optional, defaults to `[]`
+
+    - Type: map
+    - Multivalued: True
+
+
+    > (See also `tfLabelMapPreset`, set `tfLabelMapPreset=nopreset` to make sure that a preset does not override `tfLabelMap` when using this) Mapping of a label in the input TimePoint annotations to a new label of the stitched TimeFrame annotations. Must be formatted as IN_LABEL:OUT_LABEL (with a colon). To pass multiple mappings, use this parameter multiple times. When two+ TP labels are mapped to a TF  label, it essentially works as a "binning" operation. If no mapping is used, all the input labels are passed-through, meaning no change in both TP & TF labelsets. However, when at least one label is mapped, all the other "unset" labels are mapped to the negative label (`-`) and if `-` does not exist in the TF labelset, it is added automatically. Only applies when `useStitcher=true`.
+- `tfLabelMapPreset`: optional, defaults to `relaxed`
+
+    - Type: string
+    - Multivalued: False
+    - Choices: `noprebin`, `nomap`, `strict`, `simpler`, `simple`, **_`relaxed`_**, `binary-bars`, `binary-slate`, `binary-chyron-strict`, `binary-chyron-relaxed`, `binary-credits`
+
+
+    > (See also `tfLabelMap`) Preset alias of a label mapping. If not `nopreset`, this parameter will override the `tfLabelMap` parameter. Available presets are:<br/>- `noprebin`: []<br/>- `nomap`: []<br/>- `strict`: ['`B`:`Bars`', '`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`', '`I`:`Chyron-person`', '`N`:`Chyron-person`', '`C`:`Credits`', '`R`:`Credits`', '`M`:`Main`', '`O`:`Opening`', '`W`:`Opening`', '`Y`:`Chyron-other`', '`U`:`Chyron-other`', '`K`:`Chyron-other`', '`L`:`Other-text`', '`G`:`Other-text`', '`F`:`Other-text`', '`E`:`Other-text`', '`T`:`Other-text`']<br/>- `simpler`: ['`B`:`Bars`', '`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`', '`I`:`Chyron`', '`N`:`Chyron`', '`C`:`Credits`', '`R`:`Credits`']<br/>- `simple`: ['`B`:`Bars`', '`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`', '`I`:`Chyron-person`', '`N`:`Chyron-person`', '`C`:`Credits`', '`R`:`Credits`', '`M`:`Other-text`', '`O`:`Other-text`', '`W`:`Other-text`', '`Y`:`Other-text`', '`U`:`Other-text`', '`K`:`Other-text`', '`L`:`Other-text`', '`G`:`Other-text`', '`F`:`Other-text`', '`E`:`Other-text`', '`T`:`Other-text`']<br/>- `relaxed`: ['`B`:`Bars`', '`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`', '`Y`:`Chyron`', '`U`:`Chyron`', '`K`:`Chyron`', '`I`:`Chyron`', '`N`:`Chyron`', '`C`:`Credits`', '`R`:`Credits`', '`M`:`Other-text`', '`O`:`Other-text`', '`W`:`Other-text`', '`L`:`Other-text`', '`G`:`Other-text`', '`F`:`Other-text`', '`E`:`Other-text`', '`T`:`Other-text`']<br/>- `binary-bars`: ['`B`:`Bars`']<br/>- `binary-slate`: ['`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`']<br/>- `binary-chyron-strict`: ['`I`:`Chyron-person`', '`N`:`Chyron-person`']<br/>- `binary-chyron-relaxed`: ['`Y`:`Chyron`', '`U`:`Chyron`', '`K`:`Chyron`', '`I`:`Chyron`', '`N`:`Chyron`']<br/>- `binary-credits`: ['`C`:`Credits`', '`R`:`Credits`']<br/><br/> Only applies when `useStitcher=true`.
+- `pretty`: optional, defaults to `false`
+
+    - Type: boolean
+    - Multivalued: False
+    - Choices: **_`false`_**, `true`
+
+
+    > The JSON body of the HTTP response will be re-formatted with 2-space indentation
+- `runningTime`: optional, defaults to `false`
+
+    - Type: boolean
+    - Multivalued: False
+    - Choices: **_`false`_**, `true`
+
+
+    > The running time of the app will be recorded in the view metadata
+- `hwFetch`: optional, defaults to `false`
+
+    - Type: boolean
+    - Multivalued: False
+    - Choices: **_`false`_**, `true`
+
+
+    > The hardware information (architecture, GPU and vRAM) will be recorded in the view metadata
+
+
+#### Outputs
+(**Note**: "*" as a property value means that the property is required but can be any value.)
+
+(**Note**: Not all output annotations are always generated.)
+
+- [http://mmif.clams.ai/vocabulary/TimeFrame/v5](http://mmif.clams.ai/vocabulary/TimeFrame/v5)
+    - _timeUnit_ = "milliseconds"
+
+- [http://mmif.clams.ai/vocabulary/TimePoint/v4](http://mmif.clams.ai/vocabulary/TimePoint/v4)
+    - _timeUnit_ = "milliseconds"
+    - _labelset_ = a list of ["B", "S", "I", "C", "R", "M", "O", "W", "N", "Y", "U", "K", "L", "G", "F", "E", "T", "P"]
+
diff --git a/docs/_apps/swt-detection/v7.1/metadata.json b/docs/_apps/swt-detection/v7.1/metadata.json
@@ -0,0 +1,191 @@
+{
+  "name": "Scenes-with-text Detection",
+  "description": "Detects scenes with text, like slates, chyrons and credits. This app can run in three modes, depending on `useClassifier`, `useStitcher` parameters. When `useClassifier=True`, it runs in the \"TimePoint mode\" and generates TimePoint annotations. When `useStitcher=True`, it runs in the \"TimeFrame mode\" and generates TimeFrame annotations based on existing TimePoint annotations -- if no TimePoint is found, it produces an error. By default, it runs in the 'both' mode and first generates TimePoint annotations and then TimeFrame annotations on them.",
+  "app_version": "v7.1",
+  "mmif_version": "1.0.5",
+  "app_license": "Apache 2.0",
+  "identifier": "http://apps.clams.ai/swt-detection/v7.1",
+  "url": "https://github.com/clamsproject/app-swt-detection",
+  "input": [
+    {
+      "@type": "http://mmif.clams.ai/vocabulary/VideoDocument/v1",
+      "required": true
+    }
+  ],
+  "output": [
+    {
+      "@type": "http://mmif.clams.ai/vocabulary/TimeFrame/v5",
+      "properties": {
+        "timeUnit": "milliseconds"
+      }
+    },
+    {
+      "@type": "http://mmif.clams.ai/vocabulary/TimePoint/v4",
+      "properties": {
+        "timeUnit": "milliseconds",
+        "labelset": [
+          "B",
+          "S",
+          "I",
+          "C",
+          "R",
+          "M",
+          "O",
+          "W",
+          "N",
+          "Y",
+          "U",
+          "K",
+          "L",
+          "G",
+          "F",
+          "E",
+          "T",
+          "P"
+        ]
+      }
+    }
+  ],
+  "parameters": [
+    {
+      "name": "useClassifier",
+      "description": "Use the image classifier model to generate TimePoint annotations.",
+      "type": "boolean",
+      "default": true,
+      "multivalued": false
+    },
+    {
+      "name": "tpModelName",
+      "description": "Model name to use for classification, only applies when `useClassifier=true`.",
+      "type": "string",
+      "choices": [
+        "convnext_lg",
+        "convnext_small",
+        "convnext_tiny"
+      ],
+      "default": "convnext_small",
+      "multivalued": false
+    },
+    {
+      "name": "tpUsePosModel",
+      "description": "Use the model trained with positional features, only applies when `useClassifier=true`.",
+      "type": "boolean",
+      "default": true,
+      "multivalued": false
+    },
+    {
+      "name": "tpStartAt",
+      "description": "Number of milliseconds into the video to start processing, only applies when `useClassifier=true`.",
+      "type": "integer",
+      "default": 0,
+      "multivalued": false
+    },
+    {
+      "name": "tpStopAt",
+      "description": "Number of milliseconds into the video to stop processing, only applies when `useClassifier=true`.",
+      "type": "integer",
+      "default": 9223372036854775807,
+      "multivalued": false
+    },
+    {
+      "name": "tpSampleRate",
+      "description": "Milliseconds between sampled frames, only applies when `useClassifier=true`.",
+      "type": "integer",
+      "default": 1000,
+      "multivalued": false
+    },
+    {
+      "name": "useStitcher",
+      "description": "Use the stitcher after classifying the TimePoints.",
+      "type": "boolean",
+      "default": true,
+      "multivalued": false
+    },
+    {
+      "name": "tfMinTPScore",
+      "description": "Minimum score for a TimePoint to be included in a TimeFrame. A lower value will include more TimePoints in the TimeFrame (increasing recall in exchange for precision). Only applies when `useStitcher=true`.",
+      "type": "number",
+      "default": 0.5,
+      "multivalued": false
+    },
+    {
+      "name": "tfMinTFScore",
+      "description": "Minimum score for a TimeFrame. A lower value will include more TimeFrames in the output (increasing recall in exchange for precision). Only applies when `useStitcher=true`",
+      "type": "number",
+      "default": 0.9,
+      "multivalued": false
+    },
+    {
+      "name": "tfMinTFDuration",
+      "description": "Minimum duration of a TimeFrame in milliseconds, only applies when `useStitcher=true`.",
+      "type": "integer",
+      "default": 5000,
+      "multivalued": false
+    },
+    {
+      "name": "tfAllowOverlap",
+      "description": "Allow overlapping time frames, only applies when `useStitcher=true`",
+      "type": "boolean",
+      "default": false,
+      "multivalued": false
+    },
+    {
+      "name": "tfDynamicSceneLabels",
+      "description": "Labels that are considered dynamic scenes. For dynamic scenes, TimeFrame annotations contains multiple representative points to follow any changes in the scene. Only applies when `useStitcher=true`",
+      "type": "string",
+      "default": [
+        "credit",
+        "credits"
+      ],
+      "multivalued": true
+    },
+    {
+      "name": "tfLabelMap",
+      "description": "(See also `tfLabelMapPreset`, set `tfLabelMapPreset=nopreset` to make sure that a preset does not override `tfLabelMap` when using this) Mapping of a label in the input TimePoint annotations to a new label of the stitched TimeFrame annotations. Must be formatted as IN_LABEL:OUT_LABEL (with a colon). To pass multiple mappings, use this parameter multiple times. When two+ TP labels are mapped to a TF  label, it essentially works as a \"binning\" operation. If no mapping is used, all the input labels are passed-through, meaning no change in both TP & TF labelsets. However, when at least one label is mapped, all the other \"unset\" labels are mapped to the negative label (`-`) and if `-` does not exist in the TF labelset, it is added automatically. Only applies when `useStitcher=true`.",
+      "type": "map",
+      "default": [],
+      "multivalued": true
+    },
+    {
+      "name": "tfLabelMapPreset",
+      "description": "(See also `tfLabelMap`) Preset alias of a label mapping. If not `nopreset`, this parameter will override the `tfLabelMap` parameter. Available presets are:\n- `noprebin`: []\n- `nomap`: []\n- `strict`: ['`B`:`Bars`', '`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`', '`I`:`Chyron-person`', '`N`:`Chyron-person`', '`C`:`Credits`', '`R`:`Credits`', '`M`:`Main`', '`O`:`Opening`', '`W`:`Opening`', '`Y`:`Chyron-other`', '`U`:`Chyron-other`', '`K`:`Chyron-other`', '`L`:`Other-text`', '`G`:`Other-text`', '`F`:`Other-text`', '`E`:`Other-text`', '`T`:`Other-text`']\n- `simpler`: ['`B`:`Bars`', '`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`', '`I`:`Chyron`', '`N`:`Chyron`', '`C`:`Credits`', '`R`:`Credits`']\n- `simple`: ['`B`:`Bars`', '`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`', '`I`:`Chyron-person`', '`N`:`Chyron-person`', '`C`:`Credits`', '`R`:`Credits`', '`M`:`Other-text`', '`O`:`Other-text`', '`W`:`Other-text`', '`Y`:`Other-text`', '`U`:`Other-text`', '`K`:`Other-text`', '`L`:`Other-text`', '`G`:`Other-text`', '`F`:`Other-text`', '`E`:`Other-text`', '`T`:`Other-text`']\n- `relaxed`: ['`B`:`Bars`', '`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`', '`Y`:`Chyron`', '`U`:`Chyron`', '`K`:`Chyron`', '`I`:`Chyron`', '`N`:`Chyron`', '`C`:`Credits`', '`R`:`Credits`', '`M`:`Other-text`', '`O`:`Other-text`', '`W`:`Other-text`', '`L`:`Other-text`', '`G`:`Other-text`', '`F`:`Other-text`', '`E`:`Other-text`', '`T`:`Other-text`']\n- `binary-bars`: ['`B`:`Bars`']\n- `binary-slate`: ['`S`:`Slate`', '`S:H`:`Slate`', '`S:C`:`Slate`', '`S:D`:`Slate`', '`S:B`:`Slate`', '`S:G`:`Slate`']\n- `binary-chyron-strict`: ['`I`:`Chyron-person`', '`N`:`Chyron-person`']\n- `binary-chyron-relaxed`: ['`Y`:`Chyron`', '`U`:`Chyron`', '`K`:`Chyron`', '`I`:`Chyron`', '`N`:`Chyron`']\n- `binary-credits`: ['`C`:`Credits`', '`R`:`Credits`']\n\n Only applies when `useStitcher=true`.",
+      "type": "string",
+      "choices": [
+        "noprebin",
+        "nomap",
+        "strict",
+        "simpler",
+        "simple",
+        "relaxed",
+        "binary-bars",
+        "binary-slate",
+        "binary-chyron-strict",
+        "binary-chyron-relaxed",
+        "binary-credits"
+      ],
+      "default": "relaxed",
+      "multivalued": false
+    },
+    {
+      "name": "pretty",
+      "description": "The JSON body of the HTTP response will be re-formatted with 2-space indentation",
+      "type": "boolean",
+      "default": false,
+      "multivalued": false
+    },
+    {
+      "name": "runningTime",
+      "description": "The running time of the app will be recorded in the view metadata",
+      "type": "boolean",
+      "default": false,
+      "multivalued": false
+    },
+    {
+      "name": "hwFetch",
+      "description": "The hardware information (architecture, GPU and vRAM) will be recorded in the view metadata",
+      "type": "boolean",
+      "default": false,
+      "multivalued": false
+    }
+  ]
+}
diff --git a/docs/_apps/swt-detection/v7.1/submission.json b/docs/_apps/swt-detection/v7.1/submission.json
@@ -0,0 +1,6 @@
+{
+  "time": "2024-11-25T18:27:38+00:00",
+  "submitter": "keighrim",
+  "image": "ghcr.io/clamsproject/app-swt-detection:v7.1",
+  "releasenotes": "Release with newly trained models:\n\n- training data is expanded with [new annotations](https://github.com/clamsproject/aapb-annotations/pull/98).\n- label `U` is added, total number of \"raw\" labels is now 18.\n- in additional to `convnext_lg` and `convnext_tiny`, `convnext_small`-based models are added. The default is now `convnext_small` model.\n\n"
+}
diff --git a/docs/_data/app-index.json b/docs/_data/app-index.json
@@ -1,8 +1,12 @@
 {
   "http://apps.clams.ai/swt-detection": {
     "description": "Detects scenes with text, like slates, chyrons and credits.",
-    "latest_update": "2024-11-04T22:00:05+00:00",
+    "latest_update": "2024-11-25T18:27:38+00:00",
     "versions": [
+      [
+        "v7.1",
+        "keighrim"
+      ],
       [
         "v7.0",
         "keighrim"
diff --git a/docs/_data/apps.json b/docs/_data/apps.json

Original file line number	Diff line number	Diff line change
`@@ -1,8 +1,12 @@`
`1`	`1`	`{`
`2`	`2`	`"http://apps.clams.ai/swt-detection": {`
`3`	`3`	`"description": "Detects scenes with text, like slates, chyrons and credits.",`
`4`		`- "latest_update": "2024-11-04T22:00:05+00:00",`
	`4`	`+ "latest_update": "2024-11-25T18:27:38+00:00",`
`5`	`5`	`"versions": [`
	`6`	`+ [`
	`7`	`+ "v7.1",`
	`8`	`+ "keighrim"`
	`9`	`+ ],`
`6`	`10`	`[`
`7`	`11`	`"v7.0",`
`8`	`12`	`"keighrim"`