ZappushZappush
SkillsUse CasesBenchmarkCommunitySign In
      Back to Skills
      al-one

      Safety Report

      🎤 Transcribe audio files using Qwen ASR. 千问STT

      @al-one

      Transcribe audio files using Qwen ASR (千问STT). Use when the user sends voice messages and wants them converted to text.

      1,116Downloads
      5Installs
      2Stars
      1Versions
      Video & Audio6,125File Management5,911Image Processing4,554PDF & Documents3,686

      Security Analysis

      high confidence
      Clean0.08 risk

      The skill's code and instructions match its stated purpose (upload audio to the Qwen ASR demo and return a transcript); it requests no credentials, but it will upload user audio to a third-party demo service (privacy consideration).

      Mar 9, 20262 files2 concerns
      Purpose & Capabilityok

      Name, description, SKILL.md and scripts/main.py all describe transcribing audio via the Qwen ASR demo (BASE_URL = https://qwen-qwen3-asr-demo.ms.show). The script's network calls and use of gradio_client are consistent with that purpose. Requiring the 'uv' runner matches the provided usage examples (uv run ...).

      Instruction Scopenote

      Runtime instructions tell the agent to run scripts/main.py with uv and/or pipe audio over stdin. The script uploads the audio to the Qwen ASR demo (/gradio_api/upload) and then calls the service via gradio_client to get transcripts. This is within the stated purpose, but it does mean user audio is transmitted to an external third-party endpoint — the SKILL.md mentions the demo host but does not explicitly warn about uploading potentially sensitive audio.

      Install Mechanismnote

      Install entries reference installing 'uv' (brew) and pip alternatives for uv/aiohttp/gradio_client. These are reasonable and expected for running the Python script. Minor inconsistency: registry metadata lists a brew requirement for 'uv', while SKILL.md metadata also includes pip-based install options; both are plausible but the mismatch is worth noticing.

      Credentialsok

      The skill declares no required environment variables, credentials, or config paths and the code does not read secrets or unrelated environment variables. No credentials are requested or needed for the demo service, which is proportional to the functionality.

      Persistence & Privilegeok

      always is false and the skill does not modify other skills or system-wide configuration. It does not request permanent presence or elevated privileges.

      Guidance

      This skill appears to do what it says: it uploads audio to the Qwen ASR demo (https://qwen-qwen3-asr-demo.ms.show) and returns the transcript. Before installing, consider privacy and compliance: any audio you transcribe will be sent to that third-party demo service (no API key required). If the audio is sensitive, do not use this skill or run it only against a trusted/private ASR endpoint. Also note the skill expects the 'uv' runner and Python dependencies (aiohttp, gradio_client); you may prefer to run the included script locally in an isolated environment to inspect behavior and avoid installing system-wide packages.

      Latest Release

      v1.0.0

      Initial release of qwen-asr. - Transcribe audio files (wav/mp3/ogg, etc.) to text using Qwen ASR. - No configuration or API key needed. - Supports multiple languages. - Simple CLI usage and integration instructions provided. - Depends on uv, aiohttp, argparse, and gradio_client.

      Popular Skills

      self-improving-agent

      @pskoett · 1,456 stars

      Gog

      @steipete · 672 stars

      Tavily Web Search

      @arun-8687 · 620 stars

      Find Skills

      @JimLiuxinghai · 529 stars

      Proactive Agent

      @halthelobster · 426 stars

      Summarize

      @summarize · 415 stars

      Published by @al-one on ClawHub

      Zappush© 2026 Zappush
      HomeGuaranteeSupport

      Something feels unusual? We want to help: [email protected]