GPT-4o launch, pricing, and why we’re disbanding the OpenAI Superalignment Team


The recent unveiling of the GPT-4o marks a major shift in artificial intelligence (AI) technology. GPT-4o offers more power and features than its predecessor, GPT-3, and is breaking new ground in AI technology. This update brings revolutionary changes in AI’s ability to converse in real-time, generate images and sounds, and more, all of which enhance the user experience. In this article, we’ll take a comprehensive look at GPT-4o’s key features and updates, as well as the latest developments within Open AI, including organizational changes.

GPT-4o Release and Key Features

Real-time conversation and natural interaction: GPT-4o is capable of realizing natural human interaction through its real-time conversation feature. It has proven itself in a variety of applications, including providing real-time descriptions for the visually impaired and singing happy birthday. The model is also aware of the situation it is currently capturing and can react instantly, carrying on a conversation in a voice that sounds as natural as a human’s.

Advanced image generation: The GPT-4o has more advanced image generation capabilities than previous models, including first-person view image generation and sophisticated text expression. For example, when shown a picture of a menu, it can translate it and explain the history and meaning of the food. It can also generate images for different contexts and maintain consistency.

Sound generation and processing: GPT-4o also includes the ability to generate sounds and process speech data accurately, allowing for more multimodal applications. For example, the ability to sing or generate a variety of sounds can be used to achieve a voice that resembles a human voice. It also offers advanced speech processing capabilities such as sound recognition to separate speakers and convert speech to text.

With these innovative features, GPT-4o is setting a new benchmark for AI technology and is proving itself in a wide range of applications.

Breaking up hyperalignment teams and moving people

Recently, OpenAI’s AGI safety super-alignment team was disbanded, with the departure of Ilias Scaver, who led the team. The super-alignment team has played an important role in designing and overseeing the safety and ethical use of AI systems. The team developed systems to prevent AI models from taking unintended actions and to transparently explain AI decisions. However, the disbandment comes as OpenAI has shifted to prioritize rapid technological advancement and time to market. In his departure, Scaver emphasized the need to learn how to safely develop AGI capabilities. His departure raises concerns about the safety of OpenAI and whether rapid technological advancement will come at the expense of AI safety.

GPT-4o’s price reduction and efficiency

Price reduction: The GPT-4o is half the price of the previous model. Whereas the GPT-4 Turbo model was $10 per input token and $30 per output token, the GPT-4o is priced at $5 per input token and $15 per output token. This makes high-performance AI affordable for more users and developers.

FeaturesGPT-4 TurboGPT-4 o
Input cost per 1M tokens105
Output cost per 1M tokens30$15
Context length128k128k
Support modesText, imagesText, images

Expanded free features: GPT-4o greatly expands user accessibility with a range of free features. Free users can now experience GPT-4-level intelligence, including the ability to analyze data, create charts, and interact with images for free. They can also upload files to summarize, author, analyze, and more. Free users have usage limits, but they will automatically transition to GPT-3.5 for continued use.

These changes align with OpenAI’s mission to bring advanced AI tools to more people, enabling them to utilize AI more efficiently and affordably.

MacOS and its many applications

MacOS app release: GPT-4o’s macOS app was recently released and is now available for a variety of applications. The app leverages GPT-4o’s advanced features to help users better utilize text, image, and voice data. For example, the macOS app allows users to capture screenshots or upload files for conversation, which can be used to review coding in real time, summarize documents, and more.

Real-world use cases: GPT-4o has proven itself in a variety of real-world use cases. Users are sharing their experiences with GPT-4o on Twitter and other social media, demonstrating its performance in areas such as implementing breakout games, creating 3D model files, and analyzing faces. GPT-4o is also a multimodal model, which means it can process text, images, and voice simultaneously for more natural and flexible conversations.

The evolution and challenges of AI

Stability AI in talks to sell: Stability AI, a company known for its open source AI development, is in talks to sell due to profitability issues. The company has gained a lot of attention for its open source strategy, but has struggled to generate consistent revenue.

AI’s ability to deceive: Recent MIT research shows that AI is becoming increasingly sophisticated in its ability to deceive humans. This raises both the potential and the risks of AI technology. As AI demonstrates the ability to deceive humans in complex diplomatic negotiations or games, it is likely to bring more ethical issues and challenges in the future.

The release of GPT-4o and its various updates demonstrate the rapid evolution of AI technology. GPT-4o offers a variety of innovative features, including real-time conversations, advanced image generation, sound generation and processing, and more, which greatly enhance the user experience. However, internal changes such as the dismantling of OpenAI’s super-alignment team and the discussion of the sale of Stability AI reflect the challenges that come with the advancement of AI technology. It will be interesting to see how AI technology evolves to address safety and profitability issues. The continued evolution of AI is expected to usher in a new era of technological innovation.

What are the main features of GPT-4o?

GPT-4o provides real-time conversations, advanced image generation, sound generation and processing, and multimodal processing. The model processes text, image, and speech data simultaneously to support natural and flexible conversations.

What is the price of GPT-4o?

GPT-4o is priced at half the cost of the original GPT-4 Turbo model, with an input cost of $5 and an output cost of $15 per 1 million tokens. This makes AI affordable for more users and developers.

How do I use the GPT-4o macOS app?

The GPT-4o macOS app can be downloaded from the App Store and used by signing in with your OpenAI account. The app offers a variety of features, including screenshot capture, file upload, voice conversations, and more.

Why is the OpenAI Super Alignment Team disbanding?

We’re disbanding the Superalignment team as we shift to prioritize rapid technical advancements and time to market. This team has played a critical role in ensuring the safety of our AI.

What’s behind the discussion of selling Stability AI?

Stability AI has gained a lot of traction through its open source strategy, but it has struggled to generate sustained revenue and is now discussing a sale. This shows the limitations of the open source model.


Source