Welcome to steppy¶
What is Steppy?¶
Steppy is a lightweight, open-source, Python 3 library for fast and reproducible experimentation. It lets data scientist focus on data science, not on software development issues. Steppy’s minimal interface does not impose constraints, however, enables clean machine learning pipeline design.
What problem steppy solves?¶
In the course of the project, data scientist faces multiple problems.
Difficulties with reproducibility and lack of the ability to prepare
experiments quickly are two particular examples. Steppy address both
problems by introducing two simple abstractions:
Tranformer. We consider it minimal interface for building machine
Step is a wrapper over the transformer and handles multiple aspects
of the execution of the pipeline, such as saving intermediate results
(if needed), checkpoiting the model during training and much more.
Tranformer in turn, is purely computational, data scientist-defined
piece that takes an input data and produces some output data. Typical
Transformers are neural netowrk, machine learning algorithms and pre- or
Start using steppy¶
Please send us your ideas on how to improve steppy library! We are looking for your comments here: Feature requests.
At this point steppy is early-stage library heavily tested on multiple machine learning challenges (data-science-bowl, toxic-comment-classification-challenge, mapping-challenge) and educational projects (minerva-advanced-data-scientific-training).
We are developing steppy towards practical tool for data scientists who can run their experiments easily and change their pipelines with just few manipulations in the code.
You are welcome to contribute to the Steppy library. Please check CONTRIBUTING for more information.