Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Zeno v2 #166

Draft
wants to merge 303 commits into
base: main
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
303 commits
Select commit Hold shift + click to select a range
42bd8b2
add: embed resp redirection in URL struct
CorentinB Nov 20, 2024
c6a814e
hq: add finisher
equals215 Nov 20, 2024
47cbc24
amend last commit: forgot to cancel sub-routines context
equals215 Nov 20, 2024
0780536
hq: producer implemented and other fixes
equals215 Nov 20, 2024
bd60c3b
write normalizeURL function
willmhowes Nov 20, 2024
6e84df7
hq: reset the once when stopped
equals215 Nov 20, 2024
d48a301
finisher: add conditions ; global: update main
equals215 Nov 20, 2024
a9e3419
remove: seencheck on fresh item
CorentinB Nov 20, 2024
f25c2d9
main: handle second quit signal
equals215 Nov 20, 2024
227c38b
fix: order of archive tasks
CorentinB Nov 20, 2024
cc0d24e
hq: fixed the graceful stop
equals215 Nov 20, 2024
d1df4ae
fix: preprocessor
CorentinB Nov 20, 2024
2e0cd15
fix: postprocessor
CorentinB Nov 20, 2024
6aa4a80
fix: redirection
CorentinB Nov 20, 2024
6514f2c
hq: add debug logging and dereference the producer/finisher batch bef…
equals215 Nov 20, 2024
b071380
add: HTTP client(s) timeout setting
CorentinB Nov 20, 2024
7bed9fe
hq: clean package
equals215 Nov 20, 2024
678cfd3
global: normalized logging
equals215 Nov 20, 2024
96df4ce
fix: handle failed items
CorentinB Nov 20, 2024
d3920de
hq: automatically reset seeds ; global: remove seed error chan
equals215 Nov 20, 2024
8fdbf2c
global: enhanced concurrency and management of routines ; reactor: ad…
equals215 Nov 20, 2024
ba0049b
stats: increase stats where needed
equals215 Nov 20, 2024
1eb26e4
archiver: wait & close WARC writers
CorentinB Nov 21, 2024
805c337
turn String into a wrapper function for URLToString
willmhowes Nov 21, 2024
0c8d655
simple variable renaming
willmhowes Nov 21, 2024
ad24996
improve URL scheme handling, update TestNormalizeURL
willmhowes Nov 21, 2024
da22cf7
add: get url command
CorentinB Nov 22, 2024
64f1732
chore: upgrade warc lib
CorentinB Nov 22, 2024
825b39e
add: assets extraction
CorentinB Nov 22, 2024
cc6ec21
fix: seencheck
CorentinB Nov 22, 2024
461c914
fix: use URL type for seenchecking
CorentinB Nov 22, 2024
4dd2e5a
fix: normalizeURL
CorentinB Nov 22, 2024
6003df3
fix: assets handling
CorentinB Nov 22, 2024
ca79f9b
fix: remove URLs without path from assets
CorentinB Nov 22, 2024
5784280
chore: handle base URLs for child URLs properly
CorentinB Nov 22, 2024
8b86a26
chore: handle assets extraction layers
CorentinB Nov 22, 2024
398b237
fix: body consumption
CorentinB Nov 23, 2024
e6513cd
add: panic on URL request creation failure
CorentinB Nov 24, 2024
4fd7e2c
fix: only start seencheck DB if not using HQ
CorentinB Nov 24, 2024
a7c8286
fix: seencheck loop
CorentinB Nov 24, 2024
d1e6947
fix: seencheck failure handling
CorentinB Nov 24, 2024
bad6574
add: simple URLs deduplication before seenchecking
CorentinB Nov 24, 2024
5f40572
hq.finisher: send finished URLs along with ID
equals215 Nov 25, 2024
13d9c0a
add: protection against nil assets
CorentinB Nov 25, 2024
84a9309
add: remove WARCs temp dir on close
CorentinB Nov 25, 2024
569af63
fix: use proper directory for WARCs
CorentinB Nov 25, 2024
0e83a57
fix: properly remove URL from Children
CorentinB Nov 25, 2024
538f8c7
chore: rename Childs to Children
CorentinB Nov 25, 2024
e71e4b6
chore: rename Childs to Children
CorentinB Nov 25, 2024
6ef67b4
fix: do not process failed items
CorentinB Nov 25, 2024
29a1cfa
fix: item state through pipeline
CorentinB Nov 25, 2024
edb3e65
add: --disable-assets-capture
CorentinB Nov 26, 2024
e31bee7
add: spew dump
CorentinB Nov 26, 2024
e6219c4
fix: handle when item's URL can't be normalized
CorentinB Nov 26, 2024
4a70556
fix: don't report URL to HQ finisher when item has failed pre-processing
CorentinB Nov 26, 2024
259f655
fix: change hq-batch-size default value to 500
CorentinB Nov 26, 2024
9a1b8fd
fix: condition to skip URL reporting in HQ finisher
CorentinB Nov 26, 2024
c3de6e9
fix: remove temporary spew
CorentinB Nov 27, 2024
0f5a42b
Add pipeline control mechanism to instantiate, pause/resume and stop …
equals215 Nov 27, 2024
6d1ecd9
Add log config support (#171)
equals215 Nov 27, 2024
d8b684b
hq: fix consumer workers weren't canceled at stop and sending on a cl…
equals215 Nov 27, 2024
507421b
reactor: add frozen info message
equals215 Nov 27, 2024
783035a
hq: fix finisher workers weren't canceled at stop and sending on a cl…
equals215 Nov 27, 2024
279dc06
log: use default path if no path is given and create dir structure fo…
equals215 Nov 27, 2024
a0c6a85
hq: enhancement of the routines and channels synchronisation, all rou…
equals215 Nov 27, 2024
59bad08
log: add default config if config is nil (for tests)
equals215 Nov 28, 2024
8857967
gitignore adds
equals215 Nov 28, 2024
a13b67d
hq.consumer: add a check for duplicate items received from HQ
equals215 Nov 28, 2024
7730d01
reactor: add a check to see if inserted item is already present in th…
equals215 Nov 28, 2024
06a0967
hq.consumer: refactor a13b67d into a separate and unit tested function
equals215 Nov 28, 2024
5f48474
hq.consumerFetcher: fix a potential non-threadsafe pointer
equals215 Nov 28, 2024
805dc81
hq.consumer: forge a new item directly in consumerSender()
equals215 Nov 28, 2024
6119b20
models,archiver: fixed 2 race conditions with atomics
equals215 Nov 28, 2024
992cb4c
chore: upgrade gocrawlhq lib
CorentinB Nov 28, 2024
a65d342
hq.consumer: send values instead of pointers in urlBuffer
equals215 Nov 29, 2024
ca5e535
models.item: first pass of making item a complete tree structure with…
equals215 Nov 29, 2024
f2b4f14
models.item: compute max depth instead of storing it
equals215 Nov 29, 2024
887a5f5
models.item: add setters (untested)
equals215 Nov 29, 2024
2eeb6e8
models.item: add last functions needed (items, childs) and nature of …
equals215 Nov 29, 2024
6df0eed
models.item: made children operations thread-safe with a RWMutex
equals215 Nov 30, 2024
386720d
reactor: implement new Item type
equals215 Nov 30, 2024
c22eb5d
reactor: added a test condition to ensure insertedSeeds == consumedSeeds
equals215 Nov 30, 2024
a5cf8ee
hq.consumer: implement new Item type
equals215 Dec 1, 2024
db9a547
hq.finisher: implement new Item type
equals215 Dec 1, 2024
742f404
hq.producer: implement new Item type
equals215 Dec 1, 2024
be2c6cf
hq: cleaned package warnings (mostly comments)
equals215 Dec 1, 2024
7c9debb
controler: implement new Item type
equals215 Dec 1, 2024
6130730
controler: cleaned package warnings (comments)
equals215 Dec 1, 2024
fd417a7
models.item: add hasRedirection() and HasChildren() with tests
equals215 Dec 1, 2024
063fe12
models.item: test combined use of GetNodesAtLevel and GetMaxDepth
equals215 Dec 2, 2024
95ceff4
finisher: implement new Item type and adapt logic to it
equals215 Dec 2, 2024
54fa847
models.item: remove IsChild, renamed IsAChild to IsChild and edit IsS…
equals215 Dec 2, 2024
1e35f1a
models.item: add RemoveChild method
equals215 Dec 2, 2024
62d1770
models.item: add tree deduplication
equals215 Dec 4, 2024
1b90c4d
models.item: parse URL strings and use URL.String() in deduplication …
equals215 Dec 4, 2024
43805a4
models.item: add traverse function to enable complex scenarios where …
equals215 Dec 5, 2024
3bcf115
hq&seencheck: rewrote the seencheck logic to apply it to item tree ; …
equals215 Dec 5, 2024
0cd79d7
chore: upgrade warc & crawlhq lib
CorentinB Dec 9, 2024
351bc3a
preprocessor: add a panic if preprocessor receives a Failed or Comple…
equals215 Dec 9, 2024
5eb9e18
models.item: change the child and parent status in AddChild and add C…
equals215 Dec 9, 2024
d7b3477
models.item: AddChild check status collisions before adding the child…
equals215 Dec 9, 2024
817bb6c
models.item: add base field with getter and setter
equals215 Dec 9, 2024
6d7e75c
models.item: remove ItemPostProcessed as the status is superseded by …
equals215 Dec 9, 2024
75f7c42
fix: adapt archiver for tree-based items
CorentinB Dec 9, 2024
f49d70a
fix: adapt postprocessor for tree-based items
CorentinB Dec 9, 2024
41345e0
global: lots of bugs fixed...
equals215 Dec 13, 2024
2a6535d
hq&seencheck: dont seencheck the seed
equals215 Dec 16, 2024
7c8bcc1
stages: best effort to finish their work before stopping ; controler:…
equals215 Dec 17, 2024
e75658a
hq.consumer: discard items that cannot be parsed ; hq.finisher: popul…
equals215 Dec 17, 2024
50f33d9
hq.consumer: log the URL that get discarded
equals215 Dec 17, 2024
9a47211
hq: when stopping, reset the seeds still in the reactor state table i…
equals215 Dec 17, 2024
feb99e0
preprocessor: dont panic when request couldn't be made, instead log t…
equals215 Dec 18, 2024
96e48f3
hq.consumer: debug log new items inserted to reactor
equals215 Dec 18, 2024
aa03507
better panic handling and logging for items already present in reactor
equals215 Dec 18, 2024
1f4ea3b
hq.consumer: ensure that slices are nil-ed and add a check for the sa…
equals215 Dec 18, 2024
61e3fef
hq.consumer: make the urlBuffer a chan of pointers
equals215 Dec 18, 2024
4a7a155
hq.consumer: ensure all URLs are unique after receiving the batch in …
equals215 Dec 18, 2024
c023b7a
hq.consumer: ensure contiguous batches have unique URLs when received…
equals215 Dec 20, 2024
a18ebf4
hq.consumer: deep copy the URL before sending it to consumerSender
equals215 Dec 20, 2024
a1585ab
first draft of terminal ui
equals215 Dec 24, 2024
3e29426
ui: better display of the stats and log display now has a buffer
equals215 Dec 25, 2024
903ec5c
ui: redesign and graceful shutdown && global: fix some stopping concu…
equals215 Dec 25, 2024
2996dee
ui: make the log buffer == the number of displayable lines
equals215 Dec 25, 2024
75a439a
config: renamed --live-stats to --tui
equals215 Dec 26, 2024
75b7be0
ui&controler: better pause with state management
equals215 Dec 26, 2024
54d6985
ui: change the info bar && stats: correct a counter get call
equals215 Dec 27, 2024
fcc2487
ui: split different parts of the UI in files
equals215 Dec 27, 2024
a29aae2
stats: add bucket type
equals215 Dec 27, 2024
af89df7
stats: add mean type
equals215 Dec 27, 2024
8b1d78c
ui: add HTTP xxx responses rates to stats
equals215 Dec 27, 2024
d2a9518
global: add pyroscope support for profiling
equals215 Jan 6, 2025
74b006c
control: add warc writing queue watcher
equals215 Jan 6, 2025
1744c27
stats&archiver: add warc writing queue size stats
equals215 Jan 7, 2025
b120e0c
ui&logs: add a ringbuffer to store logs for the UI to consume asynchr…
equals215 Jan 8, 2025
bf81b45
pyroscope: use os.Hostname instead of using env variable
equals215 Jan 8, 2025
18268c1
pyroscope: add project in service name
equals215 Jan 9, 2025
4ca61ef
pause&tui: add a message field to the pause package so that TUI can d…
equals215 Jan 9, 2025
ac28f93
add: S3 extractor
CorentinB Jan 7, 2025
b9a53a7
wip: outlinks extraction
CorentinB Jan 7, 2025
cb303b1
add: XML extraction (assets + sitemap)
CorentinB Jan 10, 2025
8e2ee8f
add: link header extraction
CorentinB Jan 10, 2025
abb57b6
add: ina.fr site-specific code
CorentinB Jan 10, 2025
da62081
add: facebook.com post.php URL generation
CorentinB Jan 10, 2025
8e142c1
add: tiktok.com preprocessing + config UA header
CorentinB Jan 10, 2025
5e8a1f8
fix: <a> tags being treated as assets instead of outlinks
CorentinB Jan 10, 2025
883d04c
add: URL exclusion
CorentinB Jan 10, 2025
3e9bad1
fix: ensure outlinks & assets hops are set to "parent hop" + 1
CorentinB Jan 12, 2025
801760b
add: show item depth in archiver
CorentinB Jan 12, 2025
4f889c7
add: show if an item is a seed in archiver's logs
CorentinB Jan 12, 2025
c9e4c3d
chore: adding some "clarité" to the archive() function variable names
CorentinB Jan 12, 2025
3313b66
chore: log hops in archive()
CorentinB Jan 12, 2025
f542dab
fix: max hops stop logic in postprocess()
CorentinB Jan 12, 2025
059e191
fix: possible infinite loop on Facebook site-specific post-processing
CorentinB Jan 12, 2025
d900837
fix: avid graph.facebook.com being triggered by Facebook site-specifi…
CorentinB Jan 12, 2025
0d08a4a
fix: Facebook site-specific post-processing creating item as seed ins…
CorentinB Jan 12, 2025
25dfee2
chore: log item ID in case of preprocessor panic
CorentinB Jan 12, 2025
3b358eb
fix: Facebook site-specific post-processing not updating item's statu…
CorentinB Jan 12, 2025
0fc2127
remove: Facebook site-specific code until its fixed
CorentinB Jan 12, 2025
d8cf2d1
fix: max hops stop logic in postprocess()
CorentinB Jan 12, 2025
73d56e5
fix: max hops stop logic in postprocess()
CorentinB Jan 12, 2025
e0541a9
fix: close resp.Body after reading it in archive()
CorentinB Jan 12, 2025
ec4b543
stats: correct the getTotal() implementation for stats.rate
equals215 Jan 12, 2025
17673f6
optimize: memory usage of response body when being post-processed
CorentinB Jan 12, 2025
1006a6a
fix: remove useless defer to close resp.Body
CorentinB Jan 12, 2025
7766cff
fix: handle case where detected mimetype's parent is nil
CorentinB Jan 12, 2025
03cf021
optimize: memory usage when detecting mimetypes
CorentinB Jan 12, 2025
8b09857
fix: exclude SVG from XML assets extraction
CorentinB Jan 12, 2025
a5d47e6
fix: properly read data when processing body
CorentinB Jan 12, 2025
ae8d16d
chore: cleanup unused functions
CorentinB Jan 12, 2025
9f8ff88
chore: optimize body reading
CorentinB Jan 12, 2025
60235ce
fix: use body ready for creating the goquery document
CorentinB Jan 12, 2025
96ac83b
fix: process resp body in archive()
CorentinB Jan 12, 2025
d6979ae
add: spooled temp file for body post-processing
CorentinB Jan 13, 2025
974c5cf
fix: optimize goquery document usage/creation
CorentinB Jan 13, 2025
f4702ae
fix: avoid duplicate RewindBody in HTML extraction
CorentinB Jan 13, 2025
0ccce0a
add: trigger RewindBody just after document creation
CorentinB Jan 13, 2025
78009f5
fix: mimetype detection logic
CorentinB Jan 13, 2025
ed0213e
fix: avoid create the goquery document twice if assets + outlinks are…
CorentinB Jan 13, 2025
6e54375
item.dedupe: better handling of potential SIGSEGV errors
equals215 Jan 13, 2025
39ded8a
fix: panic on AddChild error in postprocessItem
CorentinB Jan 13, 2025
f948b16
fix: typo on nil check of asset
CorentinB Jan 13, 2025
01e26f2
archiver: restrict the scope for body processing
CorentinB Jan 14, 2025
74ac320
add: apply exclusions to seeds too
CorentinB Jan 14, 2025
07a7988
fix: send raw URLs to HQ
CorentinB Jan 14, 2025
ba89bfd
fix: return from preprocess when a seed is excluded
CorentinB Jan 14, 2025
650ce87
chore: adjust logging level for some log messages
CorentinB Jan 14, 2025
05fd2b1
chore: add logging for panic on preprocessor receiving non-fresh items
CorentinB Jan 14, 2025
d91d870
fix: remove Excluded state, use Completed
CorentinB Jan 14, 2025
8afc0bc
fix: panic message to return status of non-fresh item received in pre…
CorentinB Jan 14, 2025
c19d980
chore: cleanup exclusion logic
CorentinB Jan 14, 2025
2945ebf
add: limit on assets extraction depth based on tree instead of hops
CorentinB Jan 15, 2025
a63a2cd
fix: extractor tests
CorentinB Jan 15, 2025
e752ac5
add: --exclusion-file to specify a local or remote file containing a …
CorentinB Jan 15, 2025
7d99125
Fixing diverse v2 crashes (#179)
equals215 Jan 26, 2025
7483e3c
fix: hops logic in extractors
CorentinB Jan 26, 2025
d83a4f6
fix: typo
CorentinB Jan 26, 2025
fec60c0
chore: bump warc lib to v.0.8.65
CorentinB Jan 27, 2025
8b55ea8
Domains crawl refactor (#188)
equals215 Jan 28, 2025
bbef271
add: check if assets are truly files when extracting XML & JSON, if n…
CorentinB Jan 28, 2025
37dcc62
chore: bump warc lib to v.0.8.66
CorentinB Jan 30, 2025
96d0204
chore: bump warc lib to v.0.8.67
CorentinB Jan 30, 2025
91071ea
add: HTTP read deadline & bump warc lib to v0.8.68
CorentinB Jan 31, 2025
bd12395
add: streaming postprocessing for JSON
CorentinB Jan 31, 2025
7c601d3
add: streaming postprocessing for S3
CorentinB Jan 31, 2025
59be6fa
add: TestIsS3
CorentinB Jan 31, 2025
632fafc
add: TestS3 + S3 extraction refactoring
CorentinB Jan 31, 2025
d8c8ba5
add: streaming postprocessing for XML
CorentinB Jan 31, 2025
82cd8ad
fix: avoid 2-layer assets extraction when HTML is wrongly discovered …
CorentinB Jan 31, 2025
ddb2e89
add: use Zeno's User-Agent and custom HTTP client when requesting exc…
CorentinB Feb 2, 2025
43c8d63
Revert "Stream based extractors" (#194)
equals215 Feb 2, 2025
3402946
controler: moved disk and warc wq watchers to a new package and creat…
equals215 Feb 3, 2025
778043a
watchers.Disk: corrected tests
equals215 Feb 3, 2025
0a4cf6a
Add reddit.com support (#185)
NGTmeaty Feb 5, 2025
7aafcf2
controler.watchers: splitted the disk check functions further and add…
equals215 Feb 5, 2025
8658edf
Fix: reddit merge failed and got query escape code merged out
NGTmeaty Feb 5, 2025
00210e6
archiver: log remaining WARC writing while finishing
equals215 Feb 7, 2025
f5bc772
preprocessor: explicit call to Free() on ada-url to ensure no memory …
equals215 Feb 7, 2025
db5f1b4
Add PanicWithDump() function and enhance the dump generation (#205)
equals215 Feb 7, 2025
eae2c41
Adapt item model to new HQ IDs and bump `gocrawlhq` (#206)
equals215 Feb 7, 2025
e8956b2
Replace custom utils.StringInSlice with standard slices.Contains
vbanos Feb 7, 2025
6dba224
Merge pull request #208 from vbanos/use-slices-contains
CorentinB Feb 7, 2025
60667bd
Merge pull request #207 from internetarchive/v2/log-remaining-warc-wr…
CorentinB Feb 7, 2025
6dbab89
Stream based extractors (#195)
equals215 Feb 7, 2025
a085101
Simplify hopsToPath and pathToHops
vbanos Feb 7, 2025
d054e4a
Optimise extractBaseTag and add unit test
vbanos Feb 9, 2025
515c707
HTMLOutlinks unit test
vbanos Feb 9, 2025
6eda293
Fix unit test runtime error
vbanos Feb 9, 2025
775b7a1
Merge pull request #215 from vbanos/pathToHops
CorentinB Feb 10, 2025
ab870a6
Merge pull request #216 from vbanos/impr-base
CorentinB Feb 10, 2025
0795c82
Merge pull request #217 from vbanos/html-outlinks-test
CorentinB Feb 10, 2025
4b7adee
Refactor HTMLAssets and add unit tests
vbanos Feb 10, 2025
2e217db
Merge pull request #218 from vbanos/html-tests2
CorentinB Feb 10, 2025
0b7dee6
Improve isStatusCodeRedirect
vbanos Feb 11, 2025
6c4db80
Merge pull request #220 from vbanos/fix-red
CorentinB Feb 12, 2025
37df6c0
Rework concurrency for pipeline stages (#209)
equals215 Feb 12, 2025
ad3ed1e
Drop item.seed attribute (#222)
vbanos Feb 12, 2025
f836e2e
cmd: reduce pyroscope upload rate and set it to default (15s) (#224)
equals215 Feb 12, 2025
abf12f5
reactor: fixed test concurrency issues that could make the tests fail…
equals215 Feb 13, 2025
d247d94
cmd&config: add a no-log-file config to disable logging to files
equals215 Feb 13, 2025
64a9140
log: first rewrite of the package with samber/slog-multi to abstract …
equals215 Feb 13, 2025
1651396
implemented the ringbuffer a Writer and re-integrated it to the loggi…
equals215 Feb 14, 2025
b82fc2d
cmd&config: renamed logging es flags and precised that it's not imple…
equals215 Feb 14, 2025
00b728d
Merge pull request #225 from internetarchive/v2/rework-log
CorentinB Feb 15, 2025
8c20817
log.ringbuffer: amend PR with missing tests changes
equals215 Feb 16, 2025
0fced19
Remove unused const in the URL model (#228)
vbanos Feb 16, 2025
e8f4ed0
Optimize models `URLToString` (#226) thx to @yzqzss
equals215 Feb 17, 2025
6026c7d
fix: ensure directories exist before checking for disk space
NGTmeaty Feb 17, 2025
e3f9a45
chore: bump warc lib to v.0.8.70
CorentinB Feb 18, 2025
08548af
modify: set max-concurrent-assets default to 1
CorentinB Feb 18, 2025
38e264c
Bump warc lib to v0.8.71 and other libs and go to 1.24 (#232)
equals215 Feb 24, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 6 additions & 2 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -2,8 +2,12 @@ jobs/*
jobs/
Zeno
*.txt
*.sh
*.sh*
zeno.log
.vscode/
*.py
.DS_Store
.DS_Store
ZENO-*
output.log
.old/
*.warc.*
95 changes: 95 additions & 0 deletions .old/go.mod
Original file line number Diff line number Diff line change
@@ -0,0 +1,95 @@
module github.com/internetarchive/Zeno

go 1.22.4

require (
github.com/CorentinB/warc v0.8.53
github.com/PuerkitoBio/goquery v1.9.3
github.com/asaskevich/govalidator v0.0.0-20230301143203-a9d515a09cc2
github.com/clbanning/mxj/v2 v2.7.0
github.com/dustin/go-humanize v1.0.1
github.com/elastic/go-elasticsearch/v8 v8.15.0
github.com/google/uuid v1.6.0
github.com/gosuri/uilive v0.0.4
github.com/gosuri/uitable v0.0.4
github.com/grafov/m3u8 v0.12.0
github.com/internetarchive/gocrawlhq v1.2.20
github.com/paulbellamy/ratecounter v0.2.0
github.com/philippgille/gokv/leveldb v0.7.0
github.com/prometheus/client_golang v1.20.4
github.com/remeh/sizedwaitgroup v1.0.0
github.com/sirupsen/logrus v1.9.3
github.com/spf13/cobra v1.8.1
github.com/spf13/pflag v1.0.5
github.com/spf13/viper v1.19.0
github.com/telanflow/cookiejar v0.0.0-20190719062046-114449e86aa5
go.uber.org/goleak v1.3.0
golang.org/x/net v0.29.0
google.golang.org/protobuf v1.34.2
mvdan.cc/xurls/v2 v2.5.0
)

require (
github.com/andybalholm/brotli v1.1.0 // indirect
github.com/andybalholm/cascadia v1.3.2 // indirect
github.com/aws/aws-sdk-go v1.55.5 // indirect
github.com/beorn7/perks v1.0.1 // indirect
github.com/cespare/xxhash/v2 v2.3.0 // indirect
github.com/cloudflare/circl v1.4.0 // indirect
github.com/davecgh/go-spew v1.1.2-0.20180830191138-d8f796af33cc // indirect
github.com/elastic/elastic-transport-go/v8 v8.6.0 // indirect
github.com/fatih/color v1.17.0 // indirect
github.com/fsnotify/fsnotify v1.7.0 // indirect
github.com/go-logr/logr v1.4.2 // indirect
github.com/go-logr/stdr v1.2.2 // indirect
github.com/gobwas/httphead v0.1.0 // indirect
github.com/gobwas/pool v0.2.1 // indirect
github.com/gobwas/ws v1.4.0 // indirect
github.com/golang/snappy v0.0.4 // indirect
github.com/gomodule/redigo v1.9.2 // indirect
github.com/google/go-cmp v0.6.0 // indirect
github.com/hashicorp/hcl v1.0.0 // indirect
github.com/inconshreveable/mousetrap v1.1.0 // indirect
github.com/jmespath/go-jmespath v0.4.0 // indirect
github.com/json-iterator/go v1.1.12 // indirect
github.com/klauspost/compress v1.17.10 // indirect
github.com/magiconair/properties v1.8.7 // indirect
github.com/mattn/go-colorable v0.1.13 // indirect
github.com/mattn/go-isatty v0.0.20 // indirect
github.com/mattn/go-runewidth v0.0.16 // indirect
github.com/miekg/dns v1.1.62 // indirect
github.com/mitchellh/mapstructure v1.5.0 // indirect
github.com/modern-go/concurrent v0.0.0-20180306012644-bacd9c7ef1dd // indirect
github.com/modern-go/reflect2 v1.0.2 // indirect
github.com/munnerz/goautoneg v0.0.0-20191010083416-a7dc8b61c822 // indirect
github.com/onsi/gomega v1.34.2 // indirect
github.com/pelletier/go-toml/v2 v2.2.3 // indirect
github.com/philippgille/gokv/encoding v0.7.0 // indirect
github.com/philippgille/gokv/util v0.7.0 // indirect
github.com/prometheus/client_model v0.6.1 // indirect
github.com/prometheus/common v0.59.1 // indirect
github.com/prometheus/procfs v0.15.1 // indirect
github.com/refraction-networking/utls v1.6.7 // indirect
github.com/rivo/uniseg v0.4.7 // indirect
github.com/sagikazarmark/locafero v0.6.0 // indirect
github.com/sagikazarmark/slog-shim v0.1.0 // indirect
github.com/sourcegraph/conc v0.3.0 // indirect
github.com/spf13/afero v1.11.0 // indirect
github.com/spf13/cast v1.7.0 // indirect
github.com/subosito/gotenv v1.6.0 // indirect
github.com/syndtr/goleveldb v1.0.0 // indirect
github.com/ulikunitz/xz v0.5.12 // indirect
go.opentelemetry.io/otel v1.30.0 // indirect
go.opentelemetry.io/otel/metric v1.30.0 // indirect
go.opentelemetry.io/otel/trace v1.30.0 // indirect
go.uber.org/multierr v1.11.0 // indirect
golang.org/x/crypto v0.27.0 // indirect
golang.org/x/exp v0.0.0-20240909161429-701f63a606c0 // indirect
golang.org/x/mod v0.21.0 // indirect
golang.org/x/sync v0.8.0 // indirect
golang.org/x/sys v0.26.0 // indirect
golang.org/x/text v0.18.0 // indirect
golang.org/x/tools v0.25.0 // indirect
gopkg.in/ini.v1 v1.67.0 // indirect
gopkg.in/yaml.v3 v3.0.1 // indirect
)
291 changes: 291 additions & 0 deletions .old/go.sum

Large diffs are not rendered by default.

File renamed without changes.
Loading