Worker Management
This workflow provides a high-level overview of managing workers, including running, monitoring, and stopping them.
Fundamentals
Worker-Model Relationship
Each worker operates for a single model, created and managed by the AI Provider.
One Worker, One Model: Workers cannot process multiple models, but a single model can be deployed on multiple workers simultaneously.
Credit-Based Payment
AI Providers pay for workers’ runtime using credits purchased in advance.
The worker continuously deducts credits while in operation.
Requirements to Start a Worker
The model must be active.
The provider’s account must have sufficient credits to sustain the worker.
Query Processing and Worker Runtime When an end user submits a query using a valid access token, the request is routed to a worker running the AI model associated with that token.
Workers execute encrypted queries.
AI Providers are billed based on the runtime of active workers, not the number of queries processed.
Performance Considerations: High query volumes on a single worker may impact response times - scale worker sessions accordingly for optimal performance.
Scaling Workers
Providers can start additional workers for the same model if performance drops due to high query volume.
There is no limit to the number of workers that can run for a model simultaneously.
Monitoring Worker Performance
Providers can monitor worker performance by viewing:
Average Query Time: The average processing time for a query on the model.
Current Query Time: The real-time processing time for queries on the model.
Stopping Workers
Workers can be stopped manually at any time.
If credits are depleted, workers are automatically stopped.
Worker Management Workflow
Future Functionality
Performance Notifications AI Providers will receive alerts when worker performance drops below a defined threshold.
Worker Usage Reports A detailed report of worker usage will help providers analyze and optimize their resource utilization.
Scheduler for Worker Management AI Providers will be able to define schedules to start and stop workers automatically, eliminating the need for manual operations. This functionality will help align worker usage with predictable query loads and save credits during low-demand periods.
Quick Links to How-To Pages
For detailed steps, refer to the following guides:
Last updated
Was this helpful?