AladdinApplication Information💡 What is Aladdin?Aladdin (Alaya AI addin) is an operational addin developed by DataCanvas based on Serverless architecture, specifically designed for users managing VKS on the ALaya NeW platform. With this addin, users can remotely connect to VKS for efficient development, debugging, and model training. Supported by Aladdin, users can complete model training and optimization with the most efficient computing power configurations, thereby significantly improving resource utilization and development efficiency. 🌟 Advantages of Aladdin🍓Development Experience Free from Operation and Maintenance The Aladdin enables direct connectivity to VKS, seamlessly integrating local development environments with cloud computing resources. By eliminating the complexities of underlying Operation and Maintenance (O&M), it empowers developers to focus on model iteration and algorithmic innovation. 🍓Zero-Idle-Resource Architecture Based on the Serverless intelligent scaling mechanism, computing resources can dynamically scale at the millisecond level, with instances released immediately upon task completion. Additionally, the service is billed according to the actual compute duration, thereby eliminating idle costs. 🍓Cost-effective computing resource utilization Aladdin dynamically allocates GPU/NPU heterogeneous computing resources on demand, enabling large model training tasks to resume seamlessly from checkpoints after interruptions. This ensures every unit of computing power directly contributes to achieving business objectives. 🚀 Getting Start
📖 Key Features Overview:workshop Workshop serves as the user's workspace (data is stored on shared storage). Users can customize the image, compute resources, environment variables, and mount storage volumes as needed to create their workspace. Within each workspace, users can develop, debug, and train models using the remote VSCode environment, and utilize VKS resources to debug and execute programs. Development with VKS Resources Within the Workshop workspace, you can utilize VKS resources to execute programs: Python programs support GPU Run and GPU Debug, while Shell scripts support Run Shell. Each execution allows real-time configuration of required images and resources, with automatic resource release upon success or failure, ensuring model training and optimization are completed with the most optimized computational configuration.
Session Management GPU RUN, RUN shell, and GPU DEBUG each generate a session. Click the GPU Debug plugin icon in the Workshop to view the session list, which supports accessing session logs and Terminal. Image saving If development environments or dependency packages are installed in the Workshop, the Workshop can be saved as an image. Right-click the target Workshop in the Workshop list, select Save Environment, and enter an image name to complete the save. Once the image is successfully saved, you can optionally replace the Workshop's image with the newly saved one. IDE Version Description
|