Skip to content

A powerful npm package for parsing text from PowerPoint, PDF, and Word documents. This tool seamlessly extracts text, making it easier to analyze, process, and integrate with your applications.

Notifications You must be signed in to change notification settings

Geekyash10/Multiparser-Package

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MultiParser

A powerful npm package for parsing text from PowerPoint, PDF, and Word documents. This tool seamlessly extracts text, making it easier to analyze, process, and integrate with your applications.

Features

  • Parse text from PPT, PDF, and DOCX files
  • Easy-to-use API
  • High performance and accuracy
  • Supports multiple file formats
  • Lightweight and fast

Installation

Install the package via npm:

npm install @xoxoharsh/multiparser

Usage

Here's how to use the package in your project:

  • For parsing whole file:
import Parser from '@xoxoharsh/multiparser';

const parser = new Parser(filePath);

parser.extractAll().then((text) =>{
    console.log(text);
  }).catch((error) => {
    console.error("Error extracting text:", error);
  });
  • For parsing a particular page:
import Parser from '@xoxoharsh/multiparser';

const parser = new Parser(filePath);

parser
  .extractPage(pageNo)
  .then((text) => {
    console.log("Page 3 text:", text);
  })
  .catch((error) => {
    console.error("Error extracting text:", error);
  });

 // Currently this feature is not available for word documents

Contributing

We welcome contributions!

About

A powerful npm package for parsing text from PowerPoint, PDF, and Word documents. This tool seamlessly extracts text, making it easier to analyze, process, and integrate with your applications.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published