Image Processing

Resize, crop, convert, and optimize images. Your agent handles thumbnails, screenshots, and batch processing.

5367 skills·Security verified

Curated Skills

Summarize

@summarize

Summarize URLs or files with the summarize CLI (web, PDFs, images, audio, YouTube).

41594,990Clean

Nano Banana Pro

@steipete

Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.

16438,041Suspicious

Meitu Skills

@meituskills

Comprehensive Meitu AI toolkit for image and video editing. Features include AI poster design, precise background cutout, virtual try-on, e-commerce product...

1191,352Clean

Larry

@OllieWazza

Automate TikTok slideshow marketing for any app or product. Researches competitors, generates AI images, adds text overlays, posts via Postiz, tracks analyti...

1067,633Suspicious

Markdown convert

@compdf-youna

Process, convert, edit, and extract data from PDF files using the ComPDF Cloud API. Supports format conversion (Word, Excel, Image), page manipulation (merge...

99182Clean

PDF Editor

@compdf-youna

PDF Editor edits and organizes PDF pages with merge, insert, reorder, exchange, and crop operations, built on ComPDF page management capabilities for fast PD...

96234Clean

PDF Converter

@compdf-youna

PDF conversion toolkit featuring AI layout analysis and OCR. Converts PDFs to Word, Markdown, JSON, PPT, CSV, HTML, and XML for seamless LLM data processing.

93169Clean

Markdown Converter

@markdown

Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.

9215,691Clean

PDF Extract

@compdf-youna

Extract PDF extracts structured data from PDFs and images, including tables, OCR text, images, and stamps, built on ComPDF data extraction and AI document ex...

90207Clean

PDF to Word Converter

@compdf-youna

PDF to Word converts PDF to editable Word/DOCX with AI-powered layout analysis and table recognition, built on ComPDF Conversion SDK to better preserve table...

88235Clean

Playwright MCP

@Spiceman161

Browser automation via Playwright MCP server. Navigate websites, click elements, fill forms, extract data, take screenshots, and perform full browser automation workflows.

5416,493Clean

Browser Use

@ShawnPana

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with w...

4820,511Suspicious

Markdown.new Skill

@markdown

Convert public web pages into clean Markdown with markdown.new for AI workflows. Use when tasks require URL-to-Markdown conversion for summarization, RAG ing...

3716,531Clean

Xiaohongshu (小红书) Automation

@xiaohongshu

Automate Xiaohongshu (RedNote) content operations using a Python client for the xiaohongshu-mcp server. Use for: (1) Publishing image, text, and video content, (2) Searching for notes and trends, (3) Analyzing post details and comments, (4) Managing user profiles and content feeds. Triggers: xiaohongshu automation, rednote content, publish to xiaohongshu, xiaohongshu search, social media management.

3711,651Clean

Upload Videos🎥, Photos📸 & Text🖊️ to TikTok, Instagram, YouTube, X, LinkedIn, Facebook, Threads, Pinterest, Reddit & Bluesky via Upload-Post API

@upload

Upload content to social media platforms via Upload-Post API. Use when posting videos, photos, text, or documents to TikTok, Instagram, YouTube, LinkedIn, Facebook, X (Twitter), Threads, Pinterest, Reddit, or Bluesky. Supports scheduling, analytics, FFmpeg processing, and upload history.

316,246Clean

ATXP

@emilioacc

Access ATXP paid API tools for web search, AI image generation, music creation, video generation, X/Twitter search, email, and agent account management. Use...

2946,610Suspicious

Computer Use

@Ram-Raghav-S

Full desktop computer use for headless Linux servers. Xvfb + XFCE virtual desktop with xdotool automation. 17 actions (click, type, scroll, screenshot, drag,...

269,768Clean

Technical Analyst

@Veeramanikandanr48

This skill should be used when analyzing weekly price charts for stocks, stock indices, cryptocurrencies, or forex pairs. Use this skill when the user provides chart images and requests technical analysis, trend identification, support/resistance levels, scenario planning, or probability assessments based purely on chart data without consideration of news or fundamental factors.

256,340Clean

Agent Browser

@tekkenKK

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.

234,070Clean

Windows Control

@windows

Full Windows desktop control. Mouse, keyboard, screenshots - interact with any Windows application like a human.

225,306Clean

Browser Automation

@peytoncasper

Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.

2116,687Suspicious

Nanonets OCR

@shhdwi

Document extraction API by Nanonets. Convert PDFs and images to markdown, JSON, or CSV with confidence scoring. Use when you need to OCR documents, extract invoice fields, parse receipts, or convert tables to structured data.

202,572Clean

Openai Image Gen

@steipete

Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.

2013,620Suspicious

Youtube Factory

@youtube

Generate complete YouTube videos from a single prompt - script, voiceover, stock footage, captions, thumbnail. Self-contained, no external modules. 100% free...

203,112Clean

Skywork Design

@gxcun17

Skywork Design (skywork) - Generate or edit images via the Skywork Image API. Use for image creation, poster design, logo design, visual asset generation, or...

191,418Clean

Web Search

@web

This skill should be used when users need to search the web for information, find current content, look up news articles, search for images, or find videos. It uses DuckDuckGo's search API to return results in clean, formatted output (text, markdown, or JSON). Use for research, fact-checking, finding recent information, or gathering web resources.

1815,445Clean

Minimax-Multimodal-Toolkit

@minimax-ai-dev

Use mmx to generate text, images, video, speech, and music via the MiniMax AI platform. Use when the user wants to create media content, chat with MiniMax mo...

173,477Suspicious

Docker Essentials

@Arnarsson

Essential Docker commands and workflows for container management, image operations, and debugging.

1716,873Clean

xAI Grok Search

@grok

Search the web and X (Twitter) using xAI's Grok API with real-time access, citations, and image understanding

172,376Clean

OCR - Local (No API Key)

@shaw555

Extract text from images using Tesseract.js OCR (100% local, no API key required). Supports Chinese (simplified/traditional) and English.

1512,848Clean

Table Image

@dannyshmueli

Generate clean table images from data. Perfect for Discord/Telegram where ASCII tables look broken. Supports dark/light mode, custom styling, and auto-sizing...

152,938Clean

Bluesky

@bluesky

Complete Bluesky CLI: post, reply, like, repost, follow, block, mute, search, threads, images. Everything you need to engage on Bluesky from the terminal.

144,782Clean

Searxng

@abk234

Privacy-respecting metasearch using your local SearXNG instance. Search the web, images, news, and more without external API dependencies.

149,193Clean

Search X

@mvanhorn

Real-time X/Twitter search powered by Grok-4. Find tweets, trends, and discussions with citations. Grok-4.20 also returns image results alongside tweet citat...

133,549Clean

Antigravity Image Generator

@IPedrax

Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.

138,688Clean

Sector Analyst

@sector

This skill should be used when analyzing sector and industry performance charts to assess market positioning and rotation patterns. Use this skill when the user provides performance chart images (1-week or 1-month timeframes) for sectors or industries and requests market cycle assessment, sector rotation analysis, or strategic positioning recommendations based on performance data. All analysis and output are conducted in English.

111,827Clean

AI media generation API - Flux2pro, Veo3.1, Suno Ai

@vap

AI image, video, and music generation + editing via VAP API. Flux, Veo 3.1, Suno V5.

113,572Clean

Seedance Video Generation

@seedance

Generate AI videos using ByteDance Seedance. Use when the user wants to: (1) generate videos from text prompts, (2) generate videos from images (first frame, first+last frame, reference images), or (3) query/manage video generation tasks. Supports Seedance 1.5 Pro (with audio), 1.0 Pro, 1.0 Pro Fast, and 1.0 Lite models.

102,344Clean

Agent Browser

@tekkenKK

92,904Clean

tube-cog

@nitishgargiitd

YouTube content creation powered by CellCog. Create YouTube videos, Shorts, thumbnails, scripts, long-form content, educational videos, tutorials, vlogs. AI-powered YouTube creator tools.

91,415Clean

Related Use Cases

Email Automation

1545 skills

Calendar & Scheduling

3358 skills

Notifications & Alerts

2146 skills

Notes & Knowledge

2526 skills

Ready to build?

Deploy a managed AI agent with these skills in 60 seconds.

Browse All Skills