Skip to content

Commit 0bae2e0

Browse files
authored
Merge pull request #154 from clamsproject/register/0-spacy-wrapper.v1.2
App Submitted - spacy-wrapper.v1.2
2 parents 392074f + 5f0953d commit 0bae2e0

File tree

5 files changed

+167
-15
lines changed

5 files changed

+167
-15
lines changed
+85
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,85 @@
1+
---
2+
layout: posts
3+
classes: wide
4+
title: "CLAMS wrapper for spaCy NLP (v1.2)"
5+
date: 2024-06-11T12:30:19+00:00
6+
---
7+
## About this version
8+
9+
- Submitter: [marcverhagen](https://github.com/marcverhagen)
10+
- Submission Time: 2024-06-11T12:30:19+00:00
11+
- Prebuilt Container Image: [ghcr.io/clamsproject/app-spacy-wrapper:v1.2](https://github.com/clamsproject/app-spacy-wrapper/pkgs/container/app-spacy-wrapper/v1.2)
12+
- Release Notes
13+
14+
> Bumping Python SDK version, bug fixes and documentation updates
15+
> - Updated to clams-python 1.2.2
16+
> - Fixed token length (issue #30)
17+
> - Fixed problems with the pretokenized parameter (issue #32)
18+
> - Various documentation fixes.
19+
20+
## About this app (See raw [metadata.json](metadata.json))
21+
22+
**Apply spaCy NLP to all text documents in a MMIF file.**
23+
24+
- App ID: [http://apps.clams.ai/spacy-wrapper/v1.2](http://apps.clams.ai/spacy-wrapper/v1.2)
25+
- App License: Apache 2.0
26+
- Source Repository: [https://github.com/clamsproject/app-spacy-wrapper](https://github.com/clamsproject/app-spacy-wrapper) ([source tree of the submitted version](https://github.com/clamsproject/app-spacy-wrapper/tree/v1.2))
27+
- Analyzer Version: 3.6
28+
- Analyzer License: MIT
29+
30+
31+
#### Inputs
32+
(**Note**: "*" as a property value means that the property is required but can be any value.)
33+
34+
- [http://mmif.clams.ai/vocabulary/TextDocument/v1](http://mmif.clams.ai/vocabulary/TextDocument/v1) (required)
35+
(of any properties)
36+
37+
- [http://vocab.lappsgrid.org/Token](http://vocab.lappsgrid.org/Token)
38+
(of any properties)
39+
40+
41+
42+
#### Configurable Parameters
43+
(**Note**: _Multivalued_ means the parameter can have one or more values.)
44+
45+
- `pretokenized`: optional, defaults to `false`
46+
47+
- Type: boolean
48+
- Multivalued: False
49+
- Choices: **_`false`_**, `true`
50+
51+
52+
> Boolean parameter to set the app to use existing tokenization, if available, for text documents for NLP processing. Useful to process ASR documents, for example.
53+
- `pretty`: optional, defaults to `false`
54+
55+
- Type: boolean
56+
- Multivalued: False
57+
- Choices: **_`false`_**, `true`
58+
59+
60+
> The JSON body of the HTTP response will be re-formatted with 2-space indentation
61+
62+
63+
#### Outputs
64+
(**Note**: "*" as a property value means that the property is required but can be any value.)
65+
66+
(**Note**: Not all output annotations are always generated.)
67+
68+
- [http://vocab.lappsgrid.org/Token](http://vocab.lappsgrid.org/Token)
69+
(of any properties)
70+
71+
- [http://vocab.lappsgrid.org/Token#pos](http://vocab.lappsgrid.org/Token#pos)
72+
(of any properties)
73+
74+
- [http://vocab.lappsgrid.org/Token#lemma](http://vocab.lappsgrid.org/Token#lemma)
75+
(of any properties)
76+
77+
- [http://vocab.lappsgrid.org/NounChunk](http://vocab.lappsgrid.org/NounChunk)
78+
(of any properties)
79+
80+
- [http://vocab.lappsgrid.org/Sentence](http://vocab.lappsgrid.org/Sentence)
81+
(of any properties)
82+
83+
- [http://vocab.lappsgrid.org/NamedEntity](http://vocab.lappsgrid.org/NamedEntity)
84+
(of any properties)
85+
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,57 @@
1+
{
2+
"name": "CLAMS wrapper for spaCy NLP",
3+
"description": "Apply spaCy NLP to all text documents in a MMIF file.",
4+
"app_version": "v1.2",
5+
"mmif_version": "1.0.4",
6+
"analyzer_version": "3.6",
7+
"app_license": "Apache 2.0",
8+
"analyzer_license": "MIT",
9+
"identifier": "http://apps.clams.ai/spacy-wrapper/v1.2",
10+
"url": "https://github.com/clamsproject/app-spacy-wrapper",
11+
"input": [
12+
{
13+
"@type": "http://mmif.clams.ai/vocabulary/TextDocument/v1",
14+
"required": true
15+
},
16+
{
17+
"@type": "http://vocab.lappsgrid.org/Token",
18+
"required": false
19+
}
20+
],
21+
"output": [
22+
{
23+
"@type": "http://vocab.lappsgrid.org/Token"
24+
},
25+
{
26+
"@type": "http://vocab.lappsgrid.org/Token#pos"
27+
},
28+
{
29+
"@type": "http://vocab.lappsgrid.org/Token#lemma"
30+
},
31+
{
32+
"@type": "http://vocab.lappsgrid.org/NounChunk"
33+
},
34+
{
35+
"@type": "http://vocab.lappsgrid.org/Sentence"
36+
},
37+
{
38+
"@type": "http://vocab.lappsgrid.org/NamedEntity"
39+
}
40+
],
41+
"parameters": [
42+
{
43+
"name": "pretokenized",
44+
"description": "Boolean parameter to set the app to use existing tokenization, if available, for text documents for NLP processing. Useful to process ASR documents, for example.",
45+
"type": "boolean",
46+
"default": false,
47+
"multivalued": false
48+
},
49+
{
50+
"name": "pretty",
51+
"description": "The JSON body of the HTTP response will be re-formatted with 2-space indentation",
52+
"type": "boolean",
53+
"default": false,
54+
"multivalued": false
55+
}
56+
]
57+
}
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
{
2+
"time": "2024-06-11T12:30:19+00:00",
3+
"submitter": "marcverhagen",
4+
"image": "ghcr.io/clamsproject/app-spacy-wrapper:v1.2",
5+
"releasenotes": "Bumping Python SDK version, bug fixes and documentation updates\n\n- Updated to clams-python 1.2.2\n- Fixed token length (issue #30)\n- Fixed problems with the pretokenized parameter (issue #32)\n- Various documentation fixes.\n\n"
6+
}

docs/_data/app-index.json

+18-14
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,22 @@
11
{
2+
"http://apps.clams.ai/spacy-wrapper": {
3+
"description": "Apply spaCy NLP to all text documents in a MMIF file.",
4+
"latest_update": "2024-06-11T12:30:19+00:00",
5+
"versions": [
6+
[
7+
"v1.2",
8+
"marcverhagen"
9+
],
10+
[
11+
"v1.1",
12+
"keighrim"
13+
],
14+
[
15+
"v1",
16+
"keighrim"
17+
]
18+
]
19+
},
220
"http://apps.clams.ai/llava-captioner": {
321
"description": "Applies llava to video frames.",
422
"latest_update": "2024-05-23T19:38:32+00:00",
@@ -315,20 +333,6 @@
315333
]
316334
]
317335
},
318-
"http://apps.clams.ai/spacy-wrapper": {
319-
"description": "Apply spaCy NLP to all text documents in a MMIF file.",
320-
"latest_update": "2023-07-24T17:39:56+00:00",
321-
"versions": [
322-
[
323-
"v1.1",
324-
"keighrim"
325-
],
326-
[
327-
"v1",
328-
"keighrim"
329-
]
330-
]
331-
},
332336
"http://apps.clams.ai/brandeis-acs-wrapper": {
333337
"description": "Brandeis Acoustic Classification & Segmentation (ACS) is a audio segmentation tool developed at Brandeis Lab for Linguistics and Computation. The original software can be found at https://github.com/brandeis-llc/acoustic-classification-segmentation .",
334338
"latest_update": "2023-07-24T05:42:05+00:00",

docs/_data/apps.json

+1-1
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)