Safety Report

Paper Parser

Name: Paper Parser
Rating: 5 (2 reviews)
Author: mzlzyca

Parse academic papers and research documents from PDF using MinerU. Extracts structured content including title, abstract, sections, figures, tables, formula...

201Downloads

0Installs

2Stars

6Versions

Search & Retrieval4,480 PDF & Documents3,686 Web Scraping2,251 Math & Science1,012

Security Analysis

high confidence

Clean

The skill's requirements, runtime instructions, and install options are consistent with a PDF-to-structured-paper parser built on MinerU and do not request unrelated access.

Apr 3, 20261 files

Purpose & Capabilityok

The name/description (paper parsing via MinerU) matches the declared binary (mineru-open-api), the declared primary env var (MINERU_TOKEN), and the documented CLI commands.

Instruction Scopeok

SKILL.md only instructs use of the mineru-open-api CLI (extract/auth/crawl) on PDFs and URLs; it does not direct the agent to read unrelated files, secrets, or system config.

Install Mechanismok

Install options are standard: npm package and a Go install from a GitHub repo. No downloads from untrusted URLs, no archives/extract steps are present in the spec.

Credentialsok

Only one credential (MINERU_TOKEN) is required and is appropriate for an authenticated API/CLI. No unrelated secrets, config paths, or broad credential requests are present.

Persistence & Privilegeok

The skill is not forced-always, does not modify other skills, and its runtime behavior is limited to invoking the mineru-open-api CLI; autonomous invocation is allowed but is the platform default.

Guidance

This skill appears coherent for parsing academic PDFs with MinerU. Before installing, verify the npm package name and the GitHub repo (https://github.com/opendatalab/MinerU-Ecosystem) match the vendor/site (https://mineru.net), inspect the repo or package code if possible, and limit the MINERU_TOKEN scope (create a dedicated token). Install the CLI in a controlled environment, and avoid supplying unrelated or high-privilege credentials. If you rely on sensitive documents, consider testing on non-sensitive PDFs first to confirm outputs and behavior.

Latest Release

v0.4.0

SEO: expand description for better ClawHub vector search discovery

More by @mzlzyca

PPTX Parse

0 stars

PPTX to Text

0 stars

Pptx Ocr

0 stars

Pptx Extract

0 stars

PPTX Analysis

0 stars

Ppt Extract

0 stars

Published by @mzlzyca on ClawHub