Image Processing
Resize, crop, convert, and optimize images. Your agent handles thumbnails, screenshots, and batch processing.
Curated Skills
Summarize
@summarize
Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).
Nano Banana Pro
@steipete
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Larry
@OllieWazza
Automate TikTok slideshow marketing for any app or product. Researches competitors, generates AI images, adds text overlays, posts via Postiz, tracks analyti...
Markdown Converter
@markdown
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
Playwright MCP
@Spiceman161
Browser automation via Playwright MCP server. Navigate websites, click elements, fill forms, extract data, take screenshots, and perform full browser automation workflows.
Browser Use
@ShawnPana
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with w...
Xiaohongshu (小红书) Automation
@xiaohongshu
Automate Xiaohongshu (RedNote) content operations using a Python client for the xiaohongshu-mcp server. Use for: (1) Publishing image, text, and video content, (2) Searching for notes and trends, (3) Analyzing post details and comments, (4) Managing user profiles and content feeds. Triggers: xiaohongshu automation, rednote content, publish to xiaohongshu, xiaohongshu search, social media management.
Markdown.new Skill
@markdown
Convert public web pages into clean Markdown with markdown.new for AI workflows. Use when tasks require URL-to-Markdown conversion for summarization, RAG ing...
Upload Videos🎥, Photos📸 & Text🖊️ to TikTok, Instagram, YouTube, X, LinkedIn, Facebook, Threads, Pinterest, Reddit & Bluesky via Upload-Post API
@upload
Upload content to social media platforms via Upload-Post API. Use when posting videos, photos, text, or documents to TikTok, Instagram, YouTube, LinkedIn, Facebook, X (Twitter), Threads, Pinterest, Reddit, or Bluesky. Supports scheduling, analytics, FFmpeg processing, and upload history.
Computer Use
@Ram-Raghav-S
Full desktop computer use for headless Linux servers. Xvfb + XFCE virtual desktop with xdotool automation. 17 actions (click, type, scroll, screenshot, drag,...
Technical Analyst
@Veeramanikandanr48
This skill should be used when analyzing weekly price charts for stocks, stock indices, cryptocurrencies, or forex pairs. Use this skill when the user provides chart images and requests technical analysis, trend identification, support/resistance levels, scenario planning, or probability assessments based purely on chart data without consideration of news or fundamental factors.
Agent Browser
@tekkenKK
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Windows Control
@windows
Full Windows desktop control. Mouse, keyboard, screenshots - interact with any Windows application like a human.
Browser Automation
@peytoncasper
Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.
Youtube Factory
@youtube
Generate complete YouTube videos from a single prompt - script, voiceover, stock footage, captions, thumbnail. Self-contained, no external modules. 100% free...
Nanonets OCR
@shhdwi
Document extraction API by Nanonets. Convert PDFs and images to markdown, JSON, or CSV with confidence scoring. Use when you need to OCR documents, extract invoice fields, parse receipts, or convert tables to structured data.
Openai Image Gen
@steipete
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Web Search
@web
This skill should be used when users need to search the web for information, find current content, look up news articles, search for images, or find videos. It uses DuckDuckGo's search API to return results in clean, formatted output (text, markdown, or JSON). Use for research, fact-checking, finding recent information, or gathering web resources.
xAI Grok Search
@grok
Search the web and X (Twitter) using xAI's Grok API with real-time access, citations, and image understanding
Docker Essentials
@Arnarsson
Essential Docker commands and workflows for container management, image operations, and debugging.
Table Image
@dannyshmueli
Generate clean table images from data. Perfect for Discord/Telegram where ASCII tables look broken. Supports dark/light mode, custom styling, and auto-sizing...
Bluesky
@bluesky
Complete Bluesky CLI: post, reply, like, repost, follow, block, mute, search, threads, images. Everything you need to engage on Bluesky from the terminal.
Searxng
@abk234
Privacy-respecting metasearch using your local SearXNG instance. Search the web, images, news, and more without external API dependencies.
Search X
@mvanhorn
Real-time X/Twitter search powered by Grok-4. Find tweets, trends, and discussions with citations. Grok-4.20 also returns image results alongside tweet citat...
Antigravity Image Generator
@IPedrax
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.
AI media generation API - Flux2pro, Veo3.1, Suno Ai
@vap
AI image, video, and music generation + editing via VAP API. Flux, Veo 3.1, Suno V5.
Sector Analyst
@sector
This skill should be used when analyzing sector and industry performance charts to assess market positioning and rotation patterns. Use this skill when the user provides performance chart images (1-week or 1-month timeframes) for sectors or industries and requests market cycle assessment, sector rotation analysis, or strategic positioning recommendations based on performance data. All analysis and output are conducted in English.
Seedance Video Generation
@seedance
Generate AI videos using ByteDance Seedance. Use when the user wants to: (1) generate videos from text prompts, (2) generate videos from images (first frame, first+last frame, reference images), or (3) query/manage video generation tasks. Supports Seedance 1.5 Pro (with audio), 1.0 Pro, 1.0 Pro Fast, and 1.0 Lite models.
PaddleOCR Text Recognition
@Bobholamovic
Use this skill when users need to extract text from images, PDFs, or documents. Supports URLs and local files. Returns structured JSON containing recognized...
tube-cog
@nitishgargiitd
YouTube content creation powered by CellCog. Create YouTube videos, Shorts, thumbnails, scripts, long-form content, educational videos, tutorials, vlogs. AI-powered YouTube creator tools.
AI media generation- Flux2pro,Google Veo3.1, Suno Ai..
@vap
AI image, video, and music generation + editing via VAP API. Flux, Veo 3.1, Suno V5.
Agent Browser
@tekkenKK
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
Official video generation. Image to video / Text to video / Reference to video / Text to image / Reference to image / Video edit / Image edit
@calvinzhao
Generate videos or images from text, images, or references, create and edit material elements, submit and query asynchronous video generation tasks via bundl...
AI picture book generate
@ai
Generate static or dynamic picture book videos using Baidu AI
Nano Banana Pro
@DyCathecorde
Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro).
ADB Connection
@StaticAI
Control Android devices via ADB with support for UI layout analysis (uiautomator) and visual feedback (screencap). Use when you need to interact with Android apps, perform UI automation, take screenshots, or run complex ADB command sequences.
Playwright (Automation + MCP + Scraper)
@ivangdavila
Browser automation and web scraping with Playwright. Forms, screenshots, data extraction. Works standalone or via MCP. Testing included.
Polyvision
@mysticriverx
Analyze Polymarket prediction market wallets — get copy trading scores (1-10), P&L, win rate, risk metrics (Sharpe ratio, Sortino ratio, max drawdown), red f...
Excalidraw Diagram Generator
@excalidraw
Generate hand-drawn style diagrams, flowcharts, and architecture diagrams as PNG images from Excalidraw JSON
Gemini Image Gen
@gemini
Generate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero depe...
Related Use Cases
Ready to build?
Deploy a managed AI agent with these skills in 60 seconds.