In this presentation, we introduce Cloudknot, a software library that simplifies cloud-based distributed computing by programmatically executing user-defined functions (UDFs) in AWS Batch. It takes as input a Python function, packages it as a container, creates all the necessary AWS constituent resources to submit jobs, monitors their execution and gathers the results, all from within the Python environment. Cloudknot overcomes limitations of previous similar libraries, such as pywren, that runs UDFs on AWS Lambda, because most data science workloads exceed the AWS Lambda limits on execution time, RAM, and local storage.
See the full SciPy 2018 playlist at • SciPy 2018: Scientific Computing with Pyth...
コメント