Worker Management

This workflow provides a high-level overview of managing workers, including running, monitoring, and stopping them.

Fundamentals

Worker-Model Relationship
- Each worker operates for a single model, created and managed by the AI Provider.
- One Worker, One Model: Workers cannot process multiple models, but a single model can be deployed on multiple workers simultaneously.
Credit-Based Payment
- AI Providers pay for workers’ runtime using credits purchased in advance.
- The worker continuously deducts credits while in operation.
Requirements to Start a Worker
- The model must be active.
- The provider’s account must have sufficient credits to sustain the worker.
Query Processing and Worker Runtime When an end user submits a query using a valid access token, the request is routed to a worker running the AI model associated with that token.
- Workers execute encrypted queries.
- AI Providers are billed based on the runtime of active workers, not the number of queries processed.
- Performance Considerations: High query volumes on a single worker may impact response times - scale worker sessions accordingly for optimal performance.
Scaling Workers
- Providers can start additional workers for the same model if performance drops due to high query volume.
- There is no limit to the number of workers that can run for a model simultaneously.
Monitoring Worker Performance
- Providers can monitor worker performance by viewing:
  - Average Query Time: The average processing time for a query on the model.
  - Current Query Time: The real-time processing time for queries on the model.
Stopping Workers
- Workers can be stopped manually at any time.
- If credits are depleted, workers are automatically stopped.

Worker Management Workflow

Start Worker

Before starting a worker, verify that your account has sufficient credits and check that your model is properly activated in the system.

Monitor Performance

Regularly check query performance metrics (average and current query time). Add workers if needed to maintain optimal performance.

Stop Worker

Stop workers when they are no longer needed or when credit consumption needs to be managed.

Future Functionality

Performance Notifications AI Providers will receive alerts when worker performance drops below a defined threshold.
Worker Usage Reports A detailed report of worker usage will help providers analyze and optimize their resource utilization.
Scheduler for Worker Management AI Providers will be able to define schedules to start and stop workers automatically, eliminating the need for manual operations. This functionality will help align worker usage with predictable query loads and save credits during low-demand periods.

Quick Links to How-To Pages

For detailed steps, refer to the following guides:

PreviousCredit Management NextHow-To Guides

Last updated 4 months ago

Was this helpful?