You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: cleanrepo/README.md
+13-43Lines changed: 13 additions & 43 deletions
Original file line number
Diff line number
Diff line change
@@ -49,65 +49,35 @@ This command-line tool helps you clean up a DocFx-based content repo. It can:
49
49
CleanRepo.exe --orphaned-images
50
50
```
51
51
52
-
## Text to image examples
52
+
## Image to text examples
53
53
54
-
The text-to-image functionality supported in the `--catalog-images-with-text` and `--filter-images-for-text` options is provided by the [Tesseract](https://www.nuget.org/packages/tesseract/) NuGet package.
54
+
The text-to-image functionality supported in the `--catalog-images-with-text` and `--filter-images-for-text` options is provided by the [Tesseract](https://www.nuget.org/packages/tesseract/) NuGet package.
55
55
56
56
### Get the Tesseract models
57
57
58
-
You must determine which Tesseract models you want to use and install them on your system. Tesseract models are generated per operating system. Tesseract models come in a variety of sizes. You will also need to download the language data files for tesseract 4.0.0 or above from [tesseract-tessdata](https://github.com/tesseract-ocr/tessdata/). Use the `--ocr-model-directory` value to set the path.
58
+
You must determine which Tesseract models you want to use and install them on your system. Tesseract models are generated per operating system. Tesseract models come in a variety of sizes. You also need to download the language data files for Tesseract 4.0.0 or later from [tesseract-tessdata](https://github.com/tesseract-ocr/tessdata/). Use the `--ocr-model-directory` value to set the path.
59
59
60
60
### Catalog images with text
61
61
62
-
To catalog the images with text:
62
+
To catalog all the images in a specified directory along with the text shown in each image:
"Value": "Function App\n\n\u00AE Overview\n\n\n\n| View Application Insights data G)\n\n\n\n\n\n\n\n\n\nActivity log Link to an Application Insights resource\n8. Access control (IAM)\n\u00A9 tes \u00A9 temepiseaieiin yt eb ise ea\n\n@ Diagnose and solve problems\n\n\u00A9 Microsoft Defender for Cloud @ totum Apptzation ihe of check that Applicaton nights OK ard the insramentaion key are removed rm your apliaton,\n\n\u0026 events (preview)\n\nFunctions O) \u00E9sarteg etiam caer toe Gorman Vier Tc home\nApplication Insights. You have the option to disable non-essential data collection, Learn more\n(A) Functions\n\u00A9 App keys\nChange your resource\nB App files\n\n\n\nDeployment\n\n= Deployment slots\n@ Deployment Center\nSettings\n\nHl Configuration\n\n\u0026\u0026 Authentication\n\n\u00AE Application insights\n\n\n"
110
-
},
111
-
]
112
-
}
113
-
```
83
+
The output file is prefixed with `FilteredOcrImageFiles-`.
0 commit comments