Give your agent a voice. Use when the user wants the agent to speak, read aloud, or have voice responses.
Security Analysis
high confidenceThe skill implements an on-device TTS/visualizer and the files, commands, and configuration it requests are consistent with that purpose — nothing in the package looks designed to do something else.
Name/description (local TTS and visualizer) align with the included files and runtime steps. The package reasonably needs Python, espeak-ng, venvs, optional Swift compilation, and model files; those are present and used for Kokoro/MLX TTS and the macOS visualizer.
The SKILL.md and scripts direct the agent to run setup.py, start/stop a local daemon, run speak.py, and optionally compile/run a Swift visualizer. These instructions are within the stated TTS scope, but the setup step will install packages, create venvs, download models, and — on macOS — patch a third‑party module in the venv to fix library loading. Those actions are functional for the feature but worth the user's awareness.
There is no registry 'install' spec, but setup.py will create virtual environments and invoke pip to install mlx-audio/kokoro and dependencies, and on macOS will call swiftc to compile the visualizer and may run Homebrew. This involves network downloads from PyPI and model sources (expected for TTS), which is moderate risk but proportionate to the feature.
The skill requests no environment variables or external API keys. Configuration and credentials are stored under ~/.her-voice. The declared binaries (python3, espeak-ng) match the functionality and no unrelated credentials or system paths are requested.
The daemon is optional (not always:true). It creates confined files in ~/.her-voice, a UNIX socket and PID file with restrictive permissions (0600), and does not modify other skills or system-wide agent settings. Running a background daemon is expected for low-latency TTS.
Guidance
What to consider before installing: - This skill installs Python virtual environments and pip packages (mlx-audio, kokoro, numpy, etc.) and may download a TTS model (large). Ensure you have disk space and are comfortable allowing those network downloads. - On macOS it will attempt to compile a Swift visualizer and may run Homebrew to install espeak-ng; follow prompts and grant only the actions you trust. - The setup may patch a third‑party module inside the created venv to fix library loading (macOS only). The change targets the venv, not system packages, but you may want to inspect the patch before allowing it. - The optional daemon listens on a UNIX socket under ~/.her-voice with restrictive permissions (0600). Any local process with access to your user account could connect; the socket is not network-exposed. - No API keys or external endpoints are hardcoded; however pip/model downloads require internet access. If you need to be cautious, run setup in an isolated environment (VM/container) or review the scripts first. - If you are not comfortable with code that installs packages, writes to your home directory, compiles binaries, or modifies venv-installed modules, do not install or audit the code before running.
Latest Release
v1.0.2
**Improved daemon security, config clarity, and doc hints.** - TTS daemon files (PID, socket) are now stored in `~/.her-voice/` with owner-only permissions for better security. - Added documentation tip: agent now automatically resolves `SKILL_DIR` when running commands. - Improved configuration instructions for clarity and accuracy. - Updated changelog and documentation to match these behavioral updates. - No breaking changes; functionality remains the same.
More by @matusvojtek
Published by @matusvojtek on ClawHub