MultiParser

A powerful npm package for parsing text from PowerPoint, PDF, and Word documents. This tool seamlessly extracts text, making it easier to analyze, process, and integrate with your applications.

Features

Parse text from PPT, PDF, and DOCX files
Easy-to-use API
High performance and accuracy
Supports multiple file formats
Lightweight and fast

Installation

Install the package via npm:

npm install @xoxoharsh/multiparser

Usage

Here's how to use the package in your project:

For parsing whole file:

import Parser from '@xoxoharsh/multiparser';

const parser = new Parser(filePath);

parser.extractAll().then((text) =>{
    console.log(text);
  }).catch((error) => {
    console.error("Error extracting text:", error);
  });

For parsing a particular page:

import Parser from '@xoxoharsh/multiparser';

const parser = new Parser(filePath);

parser
  .extractPage(pageNo)
  .then((text) => {
    console.log("Page 3 text:", text);
  })
  .catch((error) => {
    console.error("Error extracting text:", error);
  });

 // Currently this feature is not available for word documents

Contributing

We welcome contributions!

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
scripts		scripts
tests		tests
.gitignore		.gitignore
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MultiParser

Features

Installation

Usage

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Geekyash10/Multiparser-Package

Folders and files

Latest commit

History

Repository files navigation

MultiParser

Features

Installation

Usage

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages