Safety Report

🎤 Transcribe audio files using Qwen ASR. 千问STT

Name: 🎤 Transcribe audio files using Qwen ASR. 千问STT
Rating: 5 (2 reviews)
Author: al-one

@al-one

Transcribe audio files using Qwen ASR (千问STT). Use when the user sends voice messages and wants them converted to text.

1,116Downloads

5Installs

2Stars

1Versions

Video & Audio6,125 File Management5,911 Image Processing4,554 PDF & Documents3,686

Security Analysis

high confidence

Clean0.08 risk

The skill's code and instructions match its stated purpose (upload audio to the Qwen ASR demo and return a transcript); it requests no credentials, but it will upload user audio to a third-party demo service (privacy consideration).

Mar 9, 20262 files2 concerns

Purpose & Capabilityok

Name, description, SKILL.md and scripts/main.py all describe transcribing audio via the Qwen ASR demo (BASE_URL = https://qwen-qwen3-asr-demo.ms.show). The script's network calls and use of gradio_client are consistent with that purpose. Requiring the 'uv' runner matches the provided usage examples (uv run ...).

Instruction Scopenote

Runtime instructions tell the agent to run scripts/main.py with uv and/or pipe audio over stdin. The script uploads the audio to the Qwen ASR demo (/gradio_api/upload) and then calls the service via gradio_client to get transcripts. This is within the stated purpose, but it does mean user audio is transmitted to an external third-party endpoint — the SKILL.md mentions the demo host but does not explicitly warn about uploading potentially sensitive audio.

Install Mechanismnote

Install entries reference installing 'uv' (brew) and pip alternatives for uv/aiohttp/gradio_client. These are reasonable and expected for running the Python script. Minor inconsistency: registry metadata lists a brew requirement for 'uv', while SKILL.md metadata also includes pip-based install options; both are plausible but the mismatch is worth noticing.

Credentialsok

The skill declares no required environment variables, credentials, or config paths and the code does not read secrets or unrelated environment variables. No credentials are requested or needed for the demo service, which is proportional to the functionality.

Persistence & Privilegeok

always is false and the skill does not modify other skills or system-wide configuration. It does not request permanent presence or elevated privileges.

Guidance

This skill appears to do what it says: it uploads audio to the Qwen ASR demo (https://qwen-qwen3-asr-demo.ms.show) and returns the transcript. Before installing, consider privacy and compliance: any audio you transcribe will be sent to that third-party demo service (no API key required). If the audio is sensitive, do not use this skill or run it only against a trusted/private ASR endpoint. Also note the skill expects the 'uv' runner and Python dependencies (aiohttp, gradio_client); you may prefer to run the included script locally in an isolated environment to inspect behavior and avoid installing system-wide packages.

Latest Release

v1.0.0

Initial release of qwen-asr. - Transcribe audio files (wav/mp3/ogg, etc.) to text using Qwen ASR. - No configuration or API key needed. - Supports multiple languages. - Simple CLI usage and integration instructions provided. - Depends on uv, aiohttp, argparse, and gradio_client.

Popular Skills

self-improving-agent

@pskoett · 1,456 stars

Gog

@steipete · 672 stars

Tavily Web Search

@arun-8687 · 620 stars

Find Skills

@JimLiuxinghai · 529 stars

Proactive Agent

@halthelobster · 426 stars

Summarize

@summarize · 415 stars

Published by @al-one on ClawHub