OpenAI Launches Foundry: A Platform for Running Machine Learning Models

February 23, 2023

OpenAI has announced the launch of Foundry, a new developer platform for running its newer machine learning models like GPT-3.5 on dedicated capacity. Foundry is designed for cutting-edge customers running larger workloads. According to screenshots of documentation published on Twitter by users with early access, Foundry allows inference at scale with full control over the model configuration and performance profile. The platform will offer service-level commitments for instance uptime and on-calendar engineering support.

Features of Foundry

Foundry offers the following features:

Static Allocation of Compute Capacity: Foundry delivers a “static allocation” of compute capacity on Azure, OpenAI’s preferred public cloud platform dedicated to a single customer.

Version Control: Foundry provides some level of version control, letting customers decide whether or not to upgrade to newer model releases, as well as “more robust” fine-tuning for OpenAI’s latest models.

Monitoring Capabilities: Users will be able to monitor specific instances with the same tools and dashboards that OpenAI uses to build and optimize models.

Rentals based on dedicated compute units: Rentals will be based on dedicated compute units with three-month or one-year commitments. Running an individual model instance will require a specific number of compute units.

Pricing: Instances won’t be cheap. Running a lightweight version of GPT-3.5 will cost $78,000 for a three-month commitment or $264,000 over a one-year commitment.

Implications of Foundry

Foundry is expected to be a game-changer for customers running large machine learning workloads. It will allow customers to run OpenAI's latest models with greater control, customization, and performance. Foundry also represents a significant move towards monetization for OpenAI, which has been under pressure to turn a profit after a multibillion-dollar investment from Microsoft. According to reports, the company expects to make $200 million in 2023. Compute costs have been a major hurdle for the company, and Foundry will help OpenAI generate revenue by charging customers for its dedicated compute units.

Possible GPT-4 Launch?

Eagle-eyed Twitter and Reddit users spotted that one of the text-generating models listed in the instance pricing chart has a 32k max context window. GPT-3.5, OpenAI’s latest text-generating model, has a 4k max context window, suggesting that this mysterious new model could be the long-awaited GPT-4 — or a stepping stone toward it.

Conclusion

OpenAI's Foundry platform will enable customers to run its latest machine learning models with greater control and customization. Foundry will help OpenAI generate revenue by charging customers for its dedicated compute units. With the possibility of a GPT-4 launch, OpenAI's tech is expected to play a major role in the future of artificial intelligence.

Artificial Intelligence (AI), Machine Learning (ML) and OpenAPI ChatGPT

OpenAI Launches Foundry: A Platform for Running Machine Learning Models

Comments

Post a Comment

Popular posts from this blog

OpenAI API

How to Use OpenAI Playground?

How to Solve Goldfish Memory Problem in GPT-3 ChatBot?