Skip to content
Change the repository type filter

All

    Repositories list

    • Official Elasticsearch Docker image
      Python
      Apache License 2.0
      240002Updated Jul 16, 2023Jul 16, 2023
    • portia

      Public
      Visual scraping for Scrapy
      Python
      Other
      1.4k007Updated May 23, 2023May 23, 2023
    • crawler4j

      Public
      Open Source Web Crawler for Java
      Java
      1.9k003Updated Apr 16, 2023Apr 16, 2023
    • doccano

      Public
      Open source text annotation tool for machine learning practitioner.
      Python
      MIT License
      1.7k007Updated Dec 8, 2022Dec 8, 2022
    • glusterfs

      Public
      Gluster Filesystem : Build your distributed storage in minutes
      C
      GNU General Public License v2.0
      1.1k000Updated Jan 24, 2021Jan 24, 2021
    • Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Email notice, and Mobile UI. DEMO 👉
      Python
      GNU General Public License v3.0
      575000Updated Jun 22, 2019Jun 22, 2019
    • aquarium

      Public
      Aquarium is cookiecutter template for generating splash + HA proxy docker nodes that you can run with docker compose
      Python
      MIT License
      0200Updated Jun 20, 2018Jun 20, 2018
    • Testing Lua scripts from within Scrapy using Splash
      Python
      1000Updated Jun 20, 2018Jun 20, 2018
    • Lists syntactic patterns of HTTP user-agents used by robots/crawlers/spiders
      Python
      MIT License
      262000Updated Feb 4, 2018Feb 4, 2018
    • The ELK stack powered by Docker and Compose.
      Python
      MIT License
      6.9k100Updated Dec 27, 2017Dec 27, 2017
    • esbulk

      Public
      elasticsearch fast json bulk indexing utility.
      Go
      Other
      41000Updated Sep 26, 2017Sep 26, 2017
    • Docker Compose configuration for running a Sentry server.
      Python
      MIT License
      6000Updated May 25, 2017May 25, 2017
    • Docker Apache Airflow
      Shell
      2.1k000Updated Apr 10, 2017Apr 10, 2017
    • gothumbor

      Public
      Golang client for Thumbor Image Service
      Go
      MIT License
      11000Updated Apr 10, 2017Apr 10, 2017
    • Docker file needed for creating a container contains : pyspark 1.6.0 , mongodb-hadoop 1.5.1 and jupyter notebook
      Apache License 2.0
      3000Updated Mar 16, 2017Mar 16, 2017
    • scrapely

      Public
      A pure-python HTML screen-scraping library
      HTML
      272000Updated Mar 8, 2017Mar 8, 2017
    • Shell
      111000Updated Sep 23, 2016Sep 23, 2016
    • Ansible Role - Logstash
      217000Updated Jul 13, 2016Jul 13, 2016
    • This project is fork from sivel/ansible-newrelic .
      Apache License 2.0
      3200Updated Jun 26, 2016Jun 26, 2016
    • Thumbor AWS extensions
      Python
      112000Updated May 12, 2016May 12, 2016
    • Ansible role to configure MongoDB
      Python
      GNU General Public License v2.0
      296000Updated Apr 19, 2016Apr 19, 2016
    • pysolr

      Public
      Pysolr 3.2.0. The official source.
      Python
      Other
      341000Updated Nov 23, 2015Nov 23, 2015
    • db-backup

      Public archive
      Shell
      0000Updated Aug 28, 2014Aug 28, 2014