88 Open Source Extractor Software Projects
Free and open source extractor code projects including engines, APIs, generators, and tools.
Uniextract2 2045 ⭐
Universal Extractor 2 is a tool to extract files from any type of archive or installer.
News Please 1225 ⭐
news-please - an integrated web crawler and information extractor for news that just works
Peazip 1358 ⭐
Free Zip / Unzip software and Rar file extractor. Cross-platform file and archive manager. Features volume spanning, compression, authenticated encryption. Supports 7Z, 7-Zip sfx, ACE, ARJ, Brotli, BZ2, CAB, CHM, CPIO, DEB, GZ, ISO, JAR, LHA/LZH, NSIS, OOo, PAQ/LPAQ, PEA, QUAD, RAR, RPM, split, TAR, Z, ZIP, ZIPX, Zstandard.
Assetsextractor 526 ⭐
Open Semantic Etl 194 ⭐
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Urlextract 174 ⭐
URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.
Hrconvert2 207 ⭐
A self-hosted, drag-and-drop, & nosql file conversion server that supports 62x file formats.
Monkeylearn 94 ⭐
:no_entry: ARCHIVED :no_entry: :monkey: R package for text analysis with Monkeylearn :monkey:
Babel Plugin I18next Extract 109 ⭐
Babel plugin that statically extracts i18next and react-i18next translation keys.
Gettext Extractor 78 ⭐
Gamearchives 65 ⭐
A C# library for reading several video game archive formats, and a sample file explorer.
Sourcescraper 58 ⭐
Simple library which helps you to retrieve the source of various video streaming sites.
Openbackupextractor 104 ⭐
A free program for extracting data (like voicemails) from iPhone and iPad backups.
Youtube Jextractor 92 ⭐
Android based library that allows you to download or play audio and video from Youtube, in other words - youtube-dl for android
Recursiveextractor 91 ⭐
RecursiveExtractor is a .NET Standard 2.0 archive extraction Library, Progressive Web App and Command Line Tool which can process 7zip, ar, bzip2, deb, gzip, iso, rar, tar, vhd, vhdx, vmdk, wim, xzip, and zip archives and any nested combination of the supported formats.
Datasciencecampus Mobius 48 ⭐
Scripts to extract data from the COVID-19 Google Community Mobility Reports
Mylukin Textractor 49 ⭐
一个高效的从HTML中提取正文的类库。An efficient class library for extracting text from HTML.
Ctr Tools 69 ⭐
Crash Team Racing (PS1) tools - a C# framework and a set of tools to parse files found in the original kart racing game by Naughty Dog.
Bentools Etl 57 ⭐
PHP ETL (Extract / Transform / Load) library with SOLID principles + almost no dependency.
Runescape Cache Tools 57 ⭐
A .NET library and command-line interface to interact with RuneScape's cache.
Gr Eventstream 36 ⭐
gr-eventstream is a set of GNU Radio blocks for creating precisely timed events and either inserting them into, or extracting them from normal data-streams precisely. It allows for the definition of high speed time-synchronous c++ burst event handlers, as well as bridging to standard GNU Radio Async PDU messages with precise timing easily.
Url Feature Extractor 41 ⭐
Extracting features from URLs to build a data set for machine learning. The purpose is to find a machine learning model to predict phishing URLs, which are targeted to the Brazilian population.
Exceldna Unpack 33 ⭐
Command-line utility to extract the contents of Excel-DNA add-ins packed with ExcelDnaPack
Electron Video Downloader 21 ⭐
A minimal Electron application to download videos, eg from youtube, and associated captions (optional). Uses youtube-dl under the hood.
H2pc_tagextraction 12 ⭐
A application made to extract assets from cache files of H2v using BlamLib by KornnerStudios.
Seo Audits Toolkit 163 ⭐
SEO & Security Audit for Websites. Lighthouse & Security Headers crawler, Sitemap/Keywords/Images Extractor, Summarizer, etc ...
Any23 80 ⭐
Apache Anything To Triples (Any23) is a library, a web service and a command line tool that extracts structured data in RDF format from a variety of Web documents.
Burp Sensitive Param Extractor 60 ⭐
burpsuite extension for check and extract sensitive request parameter
Resource Manager 34 ⭐
Utility for viewing, creating and extracting files from Age of Empires III .BAR archive
Minitools Bin_extractor 27 ⭐
A simple script for quickly mining sensitive information in binary files.
Gmailattachmentsextractor 23 ⭐
Downloads attachments from Gmail emails, then creates copy of emails but without extracted attachments.
Cobaltstrike Tools 15 ⭐
Tools for playing w/ CobaltStrike config - extractin, detection, processing, etc...
Alien Isolation Audio Extractor 14 ⭐
A simple tool to export and name sound files within Alien: Isolation.
Rdr2_screenshot_converter 11 ⭐
Convert and save photomode screenshots from Red Dead Redemption 2 to JPEG format.
Batch Pdf Image Extractor 12 ⭐
Extract images from PDF documents. Works on multiple and single PDF files