World-class data science skill for statistical modeling, experimentation, causal inference, and advanced analytics. Expertise in Python (NumPy, Pandas, Scikit-learn), R, SQL, statistical methods, A/B testing, time series, and business intelligence. Includes experiment design, feature engineering, model evaluation, and stakeholder communication. Use when designing experiments, building predictive models, performing causal analysis, or driving data-driven decisions.
Security Analysis
high confidenceThis skill is internally consistent with its stated senior data-scientist purpose: the included docs and simple Python scripts are scaffolding for experiment/feature/model workflows and do not request credentials, install remote code, or contain obvious exfiltration or dangerous behavior.
Name/description (senior data scientist) matches the contents: documentation about experiment design, feature engineering and evaluation plus three small Python helper scripts. There are no unrelated environment variables, credentials, or platform-specific requirements declared that would be unexpected for a data-science skill.
SKILL.md instructs running the included scripts (e.g., python scripts/*.py) which is appropriate. It also references other common tooling and commands (docker, kubectl, helm, training/evaluate scripts) that are not bundled here — the skill does not declare those binaries as required. The shipped scripts are simple scaffolds that only read command-line args and return JSON results; they do not access network endpoints, environment secrets, or sensitive host paths. Recommend validating any additional referenced commands before executing them in your environment.
No install spec is provided (instruction-only plus small local scripts). Nothing is downloaded or written to disk by an installer and there are no remote URLs or extract operations to review.
The skill requests no environment variables, credentials, or config paths. The lack of required secrets is proportional to the included code (which only uses CLI args).
always is false and the skill does not request persistent/intrusive privileges or modify other skills/config. Normal agent invocation is allowed (default) but the skill itself has no self-configuring install or persistent autorun behavior.
Guidance
This package appears coherent and benign, but follow these precautions before running: (1) Review and run the Python scripts in a safe/test environment first — they accept input/output paths and currently only return simple JSON scaffolding. (2) SKILL.md mentions docker/kubectl/helm and other scripts (train.py, evaluate.py) that are not included — don't execute commands you don't recognize or that manipulate your cluster without verification. (3) If you intend to run the skill on real data, ensure you handle PII appropriately and run under least-privileged credentials. (4) If you need the full production workflows, obtain the missing tooling/scripts from a trusted source and inspect them before use.
Latest Release
v1.0.0
Initial release of the senior-data-scientist skill. - Provides expertise in statistical modeling, experimentation, causal inference, and advanced analytics. - Covers advanced production AI/ML system design, scalable architecture, model deployment, MLOps, and more. - Includes detailed reference documentation for statistical methods, experiment design, and feature engineering. - Features comprehensive tech stack, performance targets, and security/compliance practices. - Lists common commands and workflows for development, training, deployment, and monitoring. - Outlines senior-level responsibilities including leadership, strategy, and production excellence.
More by @alirezarezvani
Published by @alirezarezvani on ClawHub