This white paper gives a high level outline for a new decentralized approach to managing data science across machines. The new approach eliminates the chaos typical of other decentralized approaches.
Specifically, the paper frames the challenges of scaling data science on hybrid infrastructures and introduces a turnkey framework that provides:
- Easy deployment on premises or in the cloud
- Reduced IT effort for provisioning & managing infrastructure and environments
- One click transfer of customized & reproducible Python or R work between machines & locations
- Automation and streamlining for versioning, containerization and best practices.