OpenAI inference cost reduction cut ChatGPT guest traffic from tens of thousands of Nvidia GPUs to just a couple hundred, ...
OpenAI inference cost reduction cut ChatGPT guest traffic from tens of thousands of Nvidia GPUs to just a couple hundred, using software optimization alone. Engineers achieved more than 50% savings ...
OpenAI is moving away from models that require heavy hand-holding and toward systems that can better infer the user’s goal, ...
According to a media report, OpenAI engineers have found optimizations that reduce the cost of operating existing AI models ...
A few days ago, on November 26th, right before Thanksgiving, OpenAI, the maker of ChatGPT, confirmed a recent security breach incident that started towards the beginning of November, which impacted ...
OpenAI unveils Codex Micro, its first hardware product designed for developers. Here's what it is designed to do.
OpenAI has introduced GPT-5.4 mini and GPT-5.4 nano, two small, cost-efficient versions of its flagship AI model. In a press release, OpenAI said that 5.4 nano and mini are “our most capable small ...
Altudo, a digital experience consultancy, today announced that it is an OpenAI Services Partner. As part of this collaboration, Altudo uses OpenAI models to help enterprises move from disconnected AI ...
OpenAI is shuttering Sora, its stand-alone AI video generation app and social network, and the availability for developers to access the Sora 2 video model family through its application programming ...
Companies using OpenAI and Anthropic AI tools were overbilled by an estimated $1.7 million, according to Vaudit’s audit of ...
OpenAI API costs can spiral when agents run wild. Here's how to set spend limits, enable hard caps, and avoid surprise AI ...