GLOSSARY: The Cost of Using LLMs

пятница, 20 октября 2023 г.

The Cost of Using LLMs

Can't read or see images? View this email in a browser

This is one of the most common thoughts CXOs have when using LLMs today. The answer, however, depends on several factors, including the number of tokens processed, the specific API used, and prices, among other things. AIM noted that GPT-4 is 50 times more expensive than Llama 2, specifically for summarisation of the Wikipedia text into half its size.

https://media2.giphy.com/media/v1.Y2lkPTc5MGI3NjExdXJkejc1eHN6NGZ2eThucnB3bGF1eGh1bGxsZHRnZGV5ZnhybTdlYiZlcD12MV9pbnRlcm5hbF9naWZfYnlfaWQmY3Q9Zw/3o6MbtelsDZdsbFB7i/giphy.gif

Presently, there are a plethora of options to choose from. While OpenAI’s models are highly rated, we have observed that open-source models like Llama 2, Falcon 180-B, and Mistral 7B are catching up to GPT-4 in terms of performance and are slowly gaining traction.

Many industry leaders are advocating for domain-specific LLMs. However, selecting an optimal LLM amidst this vast array of open-source and proprietary model demands becomes challenging, where they have to meticulously evaluate, alongside balancing between cost and performance.

Wikipedia, for instance, has six million articles, each around 750 words long. That equals 1,000 tokens because three-fourths of a word is equal to one token, so it translates to 6 billion tokens in all. When we reduce the size of Wikipedia by half, we will be left with 3 billion tokens as output.

Here’s a quick glimpse of the cost variations for summarisation among different models revealing significant differences in pricing structures.

https://stratus.campaign-image.in/images/83238000184355028_1_1697800052367_1697799641131

Similarly, the cost for RAG, fine-tuning and API calling also changes for different models. Read to find out more.

Read the LLM Economics – A Guide to Generative AI Implementation Cost report from AIM Research for some in-depth analysis and cost estimation to leverage LLMs for enterprise use cases.

Check out the LLM Calculator from MachineHack here.

The OpenAI-G42 Partnership

While AI companies are busy competing with each other, OpenAI and G42 have formed a first-of-its-kind strategic alliance to work together to build cutting-edge products for the Middle East market.

https://stratus.campaign-image.in/images/83238000184355028_9_1697800053480_image-7.jpeg

This new development comes in the backdrop of G42 launching an Arabic language AI model Jais, which contains 13 billion parameters and combines Arabic and English data. The model was built in collaboration with academicians and engineers, partly from the scarcity of bilingual language models.

Interestingly, Jais was built on supercomputers produced by Cerebras Systems. The partnership with OpenAI will help the company achieve its multilingual model goals.

Read the full story here.

Indian IT & Freshers

Amidst rampant tech layoffs at top firms in the world, Indian IT majors are showing signs of disparity in hiring freshers as they are mostly focusing on training and upskilling their existing employees, alongside scaling work automation and digital initiatives.

Here’s a quick glimpse: