The best language models for digital products in june 2024

The TIMETOACT GROUP LLM Benchmarks highlight the most powerful AI language models for digital product development. Discover which large language models performed best in june 2024.

Based on real benchmark data from our own software products, we evaluated the performance of different LLM models in addressing specific challenges. We examined specific categories such as document processing, CRM integration, external integration, marketing support, and code generation.

The highlights of the month:

The elephant in the room - Claude 3.5 Sonnet and artifacts feature
Confidential computing - how it can make AI more secure and cost-effective for companies
The trend towards small and powerful LLMs that can be operated locally

LLM Benchmarks | June 2024

Our benchmarks evaluate the models in terms of their suitability for digital product development. The higher the score, the better.

☁️ - Cloud models with proprietary license
✅ - Open source models that can be run locally without restrictions
🦙 - Local models with Llama2 license

A more detailed explanation of the respective categories can be found below the table.

Hide Cost

Model	Code	Crm	Docs	Integrate	Marketing	Reason	Final	cost	Speed
GPT-4o ☁️	85	95	100	90	82	75	88	1.24 €	1.49 rps
GPT-4 Turbo v5/2024-04-09 ☁️	80	99	98	93	88	45	84	2.51 €	0.83 rps
Claude 3.5 Sonnet ☁️	67	83	89	78	80	59	76	0.97 €	0.09 rps
GPT-4 v1/0314 ☁️	80	88	98	52	88	50	76	7.19 €	1.26 rps
GPT-4 Turbo v4/0125-preview ☁️	60	97	100	71	75	45	75	2.51 €	0.82 rps
GPT-4 v2/0613 ☁️	80	83	95	52	88	50	74	7.19 €	2.07 rps
Claude 3 Opus ☁️	64	88	100	53	76	59	73	4.83 €	0.41 rps
GPT-4 Turbo v3/1106-preview ☁️	60	75	98	52	88	62	72	2.52 €	0.68 rps
Gemini Pro 1.5 0514 ☁️	67	96	75	100	25	62	71	2.06 €	0.91 rps
Gemini Pro 1.5 0409 ☁️	62	97	96	63	75	28	70	1.89 €	0.58 rps
GPT-3.5 v2/0613 ☁️	62	81	73	75	81	48	70	0.35 €	1.39 rps
GPT-3.5 v3/1106 ☁️	62	70	71	63	78	59	67	0.24 €	2.29 rps
GPT-3.5 v4/0125 ☁️	58	87	71	60	78	47	67	0.13 €	1.41 rps
Gemini 1.5 Flash 0514 ☁️	32	97	100	56	72	41	66	0.10 €	1.76 rps
Gemini Pro 1.0 ☁️	55	86	83	60	88	26	66	0.10 €	1.35 rps
Cohere Command R+ ☁️	58	80	76	49	70	59	65	0.85 €	1.88 rps
Qwen1.5 32B Chat f16 ⚠️	64	90	82	56	78	15	64	1.02 €	1.61 rps
GPT-3.5-instruct 0914 ☁️	44	92	69	60	88	32	64	0.36 €	2.12 rps
Gemma 7B OpenChat-3.5 v3 0106 f16 ✅	62	67	84	33	81	48	63	0.22 €	4.91 rps
Meta Llama 3 8B Instruct f16🦙	74	62	68	49	80	42	63	0.35 €	3.16 rps
GPT-3.5 v1/0301 ☁️	49	82	69	67	82	24	62	0.36 €	3.93 rps
Mistral 7B OpenChat-3.5 v3 0106 f16 ✅	56	87	67	52	88	23	62	0.33 €	3.28 rps
Mistral 7B OpenChat-3.5 v2 1210 f16 ✅	58	73	72	45	88	28	61	0.33 €	3.27 rps
Llama 3 8B OpenChat-3.6 20240522 f16 ✅	64	51	76	45	88	39	60	0.30 €	3.62 rps
Starling 7B-alpha f16 ⚠️	51	66	67	52	88	36	60	0.61 €	1.80 rps
Mistral 7B OpenChat-3.5 v1 f16 ✅	46	72	72	49	88	31	60	0.51 €	2.14 rps
Yi 1.5 34B Chat f16 ⚠️	44	78	70	52	86	28	60	1.28 €	1.28 rps
Claude 3 Haiku ☁️	59	69	64	55	75	33	59	0.08 €	0.53 rps
Mixtral 8x22B API (Instruct) ☁️	47	62	62	94	75	7	58	0.18 €	3.01 rps
Claude 3 Sonnet ☁️	67	41	74	52	78	30	57	0.97 €	0.85 rps
Qwen2 7B Instruct f32 ⚠️	44	81	81	39	66	29	57	0.47 €	2.30 rps
Mistral Large v1/2402 ☁️	33	49	70	75	84	25	56	2.19 €	2.04 rps
Anthropic Claude Instant v1.2 ☁️	51	75	65	59	65	14	55	2.15 €	1.47 rps
Anthropic Claude v2.0 ☁️	57	52	55	45	84	35	55	2.24 €	0.40 rps
Cohere Command R ☁️	39	66	57	55	84	26	54	0.13 €	2.47 rps
Qwen1.5 7B Chat f16 ⚠️	51	81	60	34	60	36	54	0.30 €	3.62 rps
Anthropic Claude v2.1 ☁️	36	58	59	60	75	33	53	2.31 €	0.35 rps
Qwen1.5 14B Chat f16 ⚠️	44	58	51	49	84	17	51	0.38 €	2.90 rps
Meta Llama 3 70B Instruct b8🦙	46	72	53	29	82	18	50	7.32 €	0.22 rps
Mistral 7B OpenOrca f16 ☁️	42	57	76	21	78	26	50	0.43 €	2.55 rps
Mistral 7B Instruct v0.1 f16 ☁️	31	71	69	44	62	21	50	0.79 €	1.39 rps
Llama2 13B Vicuna-1.5 f16🦙	36	37	53	39	82	38	48	1.02 €	1.07 rps
Codestral v1 ⚠️	33	47	43	71	66	13	45	0.31 €	3.98 rps
Google Recurrent Gemma 9B IT f16 ⚠️	46	27	71	45	56	25	45	0.93 €	1.18 rps
Mistral Small v1/2312 (Mixtral) ☁️	10	67	65	51	56	8	43	0.19 €	2.17 rps
Llama2 13B Hermes f16🦙	38	24	30	61	60	43	43	1.03 €	1.06 rps
Mistral Small v2/2402 ☁️	27	42	36	82	56	8	42	0.19 €	3.14 rps
Llama2 13B Hermes b8🦙	32	25	29	61	60	43	42	4.94 €	0.22 rps
Mistral Medium v1/2312 ☁️	36	43	27	59	62	12	40	0.83 €	0.35 rps
IBM Granite 34B Code Instruct f16 ☁️	52	49	30	44	57	5	40	1.12 €	1.46 rps
Llama2 13B Puffin f16🦙	37	15	38	48	56	41	39	4.89 €	0.22 rps
Llama2 13B Puffin b8🦙	37	14	37	46	56	39	38	8.65 €	0.13 rps
Mistral Tiny v1/2312 (7B Instruct v0.2) ☁️	13	47	57	40	59	8	37	0.05 €	2.30 rps
Llama2 13B chat f16🦙	15	38	17	45	75	8	33	0.76 €	1.43 rps
Llama2 13B chat b8🦙	15	38	15	45	75	6	32	3.35 €	0.33 rps
Mistral 7B Notus-v1 f16 ⚠️	16	54	25	41	48	4	31	0.80 €	1.37 rps
Mistral 7B Zephyr-β f16 ✅	28	34	46	44	29	4	31	0.51 €	2.14 rps
Llama2 7B chat f16🦙	20	33	20	42	50	20	31	0.59 €	1.86 rps
Orca 2 13B f16 ⚠️	15	22	32	22	67	19	29	0.99 €	1.11 rps
Mistral 7B Instruct v0.2 f16 ☁️	7	30	50	13	58	8	28	1.00 €	1.10 rps
Microsoft Phi 3 Mini 4K Instruct f16 ⚠️	36	35	31	1	50	6	27	0.87 €	1.26 rps
Mistral 7B v0.1 f16 ☁️	0	9	42	42	52	12	26	0.93 €	1.17 rps
Microsoft Phi 3 Medium 4K Instruct f16 ⚠️	12	34	30	13	47	8	24	0.85 €	1.28 rps
Google Gemma 2B IT f16 ⚠️	20	28	14	39	15	20	23	0.32 €	3.44 rps
Orca 2 7B f16 ⚠️	13	0	24	18	52	4	19	0.81 €	1.34 rps
Google Gemma 7B IT f16 ⚠️	0	0	0	9	62	0	12	1.03 €	1.06 rps
Llama2 7B f16🦙	0	5	18	3	28	2	9	1.01 €	1.08 rps
Yi 1.5 9B Chat f16 ⚠️	0	4	29	8	0	8	8	1.46 €	0.75 rps

The benchmark categories in detail

Here's exactly what we're looking at with the different categories of LLM Leaderboards

How well can the model work with large documents and knowledge bases?

How well does the model support work with product catalogs and marketplaces?

Can the model easily interact with external APIs, services and plugins?

How well can the model support marketing activities, e.g. brainstorming, idea generation and text generation?

How well can the model reason and draw conclusions in a given context?

Can the model generate code and help with programming?

The estimated cost of running the workload. For cloud-based models, we calculate the cost according to the pricing. For on-premises models, we estimate the cost based on GPU requirements for each model, GPU rental cost, model speed, and operational overhead.

The "Speed" column indicates the estimated speed of the model in requests per second (without batching). The higher the speed, the better.

Deeper insights

Claude 3.5 Sonnet - Anthropic did it again

Remember how Anthropic made a big quality improvement in their models in March?

They have just done it again by releasing Claude 3.5 Sonnet. This mid-range model is not only more powerful than the top-of-the-range Opus model, but also about five times cheaper.

Improved performance with Claude 3.5 Sonnet

Claude 3.5 Sonnet better follows instructions and has same reasoning capabilities as their top model - Haiku, so this is a huge improvement.

NEW: ARTIFACTS FOR A BETTER USER EXPERIENCE

There is one more big improvement in the product line of Anthropic, though. It is called Artifacts, and it isn’t even about LLM capability, but rather about user experience and LLM integration.

ARTIFACTS: WORKING EFFICIENTLY WITH DOCUMENTS AND CODE

The idea of Artifacts is: when you are working on some document or a piece of code, Claude web chat, will pull this document into a convenient separate window. This document will now become an entity of its own, not just a snippet that is repeated in the web chat. Artifacts are versioned, and you can properly iterate on them.

This may seem like a small feature, but together with Claude 3.5 Sonnet, it becomes a huge productivity boost that makes it worthwhile to use Claude Chat instead of ChatGPT when working with documents and code snippets.

Small, efficient models are getting better and better

Last month we tested several local LLMs. There were some pleasant surprises:

First of all, it was about Google Gemma 7B Instruct. This Google model is often criticized for being too restricted and limited.

However, the OpenChat 3.5 fine-tuning of this model reveals its true capabilities and places this 7B model above the first version of GPT-3.5.

It is rumored that GPT-3.5 had about 20-175B parameters, and this small 7B model (which can run on a laptop) manages to outperform it! The rate of progress is impressive.

In fact, the only local LLM that performs better than this model (in our benchmarks) is AliBaba's Qwen1.5-32B model. However, this model has a non-standard license and requires more than four times as many resources to run.

As you can see from the picture, there are already many 7B models with performance comparable to early versions of GPT-3.5. Based on the trends, the progress will not just end there.

Poorer performing models

Not all local models performed so well in our benchmark. Here are some that performed poorly (mostly because they couldn't follow even basic instructions accurately):

- Yi 1.5 34B Chat

- Google Recurrent Gemma 9B IT

- Microsoft Phi 3 Mini/Medium

- Google Gemma 2B/7B

Apple Privacy Model and Confidential Computing

In its latest announcement, Apple has started to introduce more AI features to its ecosystem. One of the most interesting aspects was the concept of Private Cloud Compute.

Essentially, the iPhone will use a small and efficient LLM model to process all incoming requests. This LLM is not very powerful and comparable to modern 7B models. However, it is fast and will process all requests in a secure way - locally.

It becomes particularly interesting when the LLM-controlled system recognizes that it needs more computing power to process the request.

In this case, it has two options:

It can ask the user for permission to send the specific request to OpenAI GPT.
It can securely forward the request to a private cloud compute managed by Apple.

What is private cloud compute?

It is a protected Apple datacenter that uses their own chips to host powerful Large Language Models. The setup gives strong guarantees that your personal requests will be handled securely and nobody, not even Apple, will even see questions and answers.

This is done through a combination of special hardware, encryption, secured VM images and mutual attestation between the software and hardware. Ultimately, they do their best to make it very hard and expensive to break this setup even by Apple or governments.

Apple is all about consumer electronics, is there anything comparable for companies?

Yes, it does exist. It's called confidential computing. The concept has been around for some time (see the Confidential Computing Consortium), but has only recently been properly applied to GPUs by Nvidia. Nvidia introduced it in the Hopper architecture (H100 GPUs) and almost completely eliminated the performance penalty in the Blackwell architecture.

The concept is the same as Apple's PCC:

data is encrypted in transit and at rest
data is decrypted during the computation time
hardware and software are designed to make it impossible (really hard and expensive) to take a look at the data while it is decrypted.

Major cloud providers are already testing VMs with confidential GPU calculation (e.g. Microsoft Azure with H100 since 2023, Google Cloud with H100 since 2024).

This approach is interesting because it offers a third option to companies that need to build a secure LLM-driven system:

Options	Guarantees	Investments in advance	Costs for operation
OpenAI from Microsoft	Medium. Not everyone likes sending data to third parties. But many already use MS Office	None	High - we pay per request
Our own data center with GPUs	Very high - data remains within our security perimeter.	Huge - GPUs are expensive, lead times are also long.	Low
Renting confidential GPU calculation	High - there are many guarantees that our data is protected from everyone else.	Low - we can pay as we go	High - we pay per rental period

Just like with hybrid clouds (they were a big thing in the past, but are a norm these days), we can mix-and-match these options for a cost-effective and secure solution, just like Apple does with PCC. For example:

Have a small local deployment that runs cost-effective 7B models on our own hardware. It will handle all requests locally.
If a user request needs more powerful AI/LLM and doesn’t involve critical information - route requests to Azure OpenAI
If a user request is both sensitive and requires a lot of GPU compute, then - route it to a confidential compute in the cloud.

Ultimately, if the powerful-and-confidential workload is steady enough, it might make sense to add a few local and powerful GPUs to handle it. During the peaks we can still rent confidential compute in the cloud.

With an H100 setup, you can expect high performance even with a single GPU if you use the right software and optimization profile. For example, you can achieve +20-50% throughput with Llama 3 8B at fp16 by changing the backend from vLLM to TensorRT backend with Nvidia NIM-setup. Since the H100 hardware also supports fp8 quantization, we can even achieve +10-30% performance by switching from fp16 to fp8. NB: Performance gains will depend on the overall context size, batch size and nature of the workload.

LLM Benchmarks Archive

Interested in the benchmarks of the past months? You can find all the links on our LLM Benchmarks overview page!

Learn more

Transform your digital projects with the best AI language models!

Discover the transformative power of the best LLM and revolutionize your digital products with AI! Stay future-oriented, increase efficiency and secure a clear competitive advantage. We support you in taking your business value to the next level.

Martin Warnung

martin.warnung@timetoact.at

Blog

ChatGPT & Co: LLM Benchmarks for October

Find out which large language models outperformed in the October 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog

ChatGPT & Co: LLM Benchmarks for September

Find out which large language models outperformed in the September 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog

ChatGPT & Co: LLM Benchmarks for November

Find out which large language models outperformed in the November 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog

ChatGPT & Co: LLM Benchmarks for December

Find out which large language models outperformed in the December 2024 benchmarks. Stay informed on the latest AI developments and performance metrics.

Blog

ChatGPT & Co: LLM Benchmarks for January

Find out which large language models outperformed in the January 2025 benchmarks. Stay informed on the latest AI developments and performance metrics.

Martin WarnungBlog

Blog

Common Mistakes in the Development of AI Assistants

How fortunate that people make mistakes: because we can learn from them and improve. We have closely observed how companies around the world have implemented AI assistants in recent months and have, unfortunately, often seen them fail. We would like to share with you how these failures occurred and what can be learned from them for future projects: So that AI assistants can be implemented more successfully in the future!

Jörg EgretzbergerBlog

Blog

8 tips for developing AI assistants

AI assistants for businesses are hype, and many teams were already eagerly and enthusiastically working on their implementation. Unfortunately, however, we have seen that many teams we have observed in Europe and the US have failed at the task. Read about our 8 most valuable tips, so that you will succeed.

Blog

AI for social good

Discover how leading companies are already profiting from Gen AI!

Kompetenz

Graph Technology

We help you harness the power of graphs to transform your business. Our expertise spans from graph database modelling and graph data science to generative AI.

Blog

AI Contest - Enterprise RAG Challenge

TIMETOACT GROUP Austria demonstrates how RAG technologies can revolutionize processes with the Enterprise RAG Challenge.

TIMETOACT

Technologie

IBM Cloud Pak for Data Accelerator

For a quick start in certain use cases, specifically for certain business areas or industries, IBM offers so-called accelerators based on the "Cloud Pak for Data" solution, which serve as a template for project development and can thus significantly accelerate the implementation of these use cases. The platform itself provides all the necessary functions for all types of analytics projects, and the accelerators provide the respective content.

TIMETOACT GROUP

Service

Service

AI & Data Science

The amount of data that companies produce and process every day is constantly growing. This data contains valuable information about customers, markets, business processes and much more. But how can companies use this data effectively to make better decisions, improve their products and services and tap into new business opportunities?

Rinat AbdullinBlog

Blog

Open-sourcing 4 solutions from the Enterprise RAG Challenge

Our RAG competition is a friendly challenge different AI Assistants competed in answering questions based on the annual reports of public companies.

Blog

Crisis management & building a sustainable future with AI

Non-profit organizations develop AI models to tackle global challenges - and draw lessons for businesses worldwide

Blog

Expanding Opportunities with Generative AI

Discover how nonprofits use generative AI to boost career opportunities, enhance education, and bridge employment gaps for underserved communities.

Blog

Google Workspace: AI-supported work for every company

The future of work with Google Workspace and Google AI

Felix KrauseBlog

Blog

AIM Hackathon 2024: Sustainability Meets LLMs

Focusing on impactful AI applications, participants addressed key issues like greenwashing detection, ESG report relevance mapping, and compliance with the European Green Deal.

TIMETOACT

Referenz

Standardized data management creates basis for reporting

TIMETOACT implements a higher-level data model in a data warehouse for TRUMPF Photonic Components and provides the necessary data integration connection with Talend. With this standardized data management, TRUMPF will receive reports based on reliable data in the future and can also transfer the model to other departments.

Blog

The ROI of Gen AI

Discover how leading companies are already profiting from Gen AI!

Rinat AbdullinBlog

Blog

LLM Performance Series: Batching

Beginning with the September Trustbit LLM Benchmarks, we are now giving particular focus to a range of enterprise workloads. These encompass the kinds of tasks associated with Large Language Models that are frequently encountered in the context of large-scale business digitalization.

Kompetenz

AI - A technology is revolutionizing our everyday lives

For ARS, AI is an increasingly natural and organic part of software engineering. This is particularly true in cases where it is an integral part of applications and functions.

TIMETOACT

Martin LangeBlog

Blog

License Management – Everything you need to know

License management is not only relevant in terms of compliance but can also minimize costs and risks. Read more in the article.

Blog

Third Place - AIM Hackathon 2024: The Venturers

ESG reports are often filled with vague statements, obscuring key facts investors need. This team created an AI prototype that analyzes these reports sentence-by-sentence, categorizing content to produce a "relevance map".

Blog

Second Place - AIM Hackathon 2024: Trustpilot for ESG

The NightWalkers designed a scalable tool that assigns trustworthiness scores based on various types of greenwashing indicators, including unsupported claims and inaccurate data.

TIMETOACT

Referenz

Interactive online portal identifies suitable employees

TIMETOACT digitizes several test procedures for KI.TEST to determine professional intelligence and personality.

TIMETOACT

Referenz

Managed service support for optimal license management

To ensure software compliance, TIMETOACT supports FUNKE Mediengruppe with a SAM Managed Service for Microsoft, Adobe, Oracle and IBM.

Blog

Responsible AI: A Guide to Ethical AI Development

Responsible AI is a key requirement in the development and use of AI technologies. You can find everything you need to know here!

Blog

Google Threat Intelligence

Threat intelligence at Google scale for you and your business!

TIMETOACT

Technologie

Technologie

IBM Cloud Pak for Data – Test-Drive

By making our comprehensive demo and customer data platform available, we want to offer these customers a way to get a very quick and pragmatic impression of the technology with their data.

TIMETOACT GROUP

Branche

Digital transformation in public administration

The digital transformation will massively change the world of work, especially in public administration. We support federal, state and local authorities in the strategic and technical implementation of their administrative modernisation projects.

TIMETOACT

Technologie

IBM Watson Knowledge Studio

In IBM Watson Knowledge Studio, you train an Artificial Intelligence (AI) on specialist terms of your company or specialist area ("domain knowledge"). In this way, you lay the foundation for automated text processing of extensive, subject-related documents.

TIMETOACT

Technologie

IBM Watson Discovery

With Watson Discovery, company data is searched using modern AI to extract information. On the one hand, the AI uses already trained methods to understand texts; on the other hand, it is constantly developed through new training on the company data, its structure and content, thus constantly improving the search results.

TIMETOACT

Technologie

IBM Watson Assistant

Watson Assistant identifies intention in requests that can be received via multiple channels. Watson Assistant is trained based on real-live requests and can understand the context and intent of the query based on the acting AI. Extensive search queries are routed to Watson Discovery and seamlessly embedded into the search result.

TIMETOACT GROUP

Leistung

Leistung

Mastering digital transformation in insurance

Versicherer haben daher bereits die Chancen und Notwendigkeiten der Digitalisierung größtenteils erkannt. Trotzdem ist noch viel zu tun, denn Digitalisierung funktioniert nicht von einem Tag auf den anderen – besonders bei Versicherungen, bei denen es viele altmodische und langsame Prozesse gibt.

TIMETOACT GROUP

Branche

Branche

On-site digitization partner for insurance companies

As TIMETOACT GROUP, we are one of the leading digitization partners for IT solutions in Germany, Austria and Switzerland. As your partner, we are there for you at 17 locations and will find the right solution on the path to digitization - gladly together in a personal exchange on site.

TIMETOACT GROUP

Branche

Branche

Internal and external security

Defense forces and police must protect citizens and the state from ever new threats. Modern IT & software solutions support them in this task.

TIMETOACT GROUP

Branche

AI & Digitization for the Transportation and Logistics Indus

Digitalisierung und Transparenz der Prozesse sowie automatisierte Unterstützung bei der Optimierung können Logistikunternehmen helfen, den Spagat zwischen Kosten und Leistung besser zu bewältigen, um langfristig als wertvoller Partner der Wirtschaft zu agieren.

TIMETOACT GROUP

Service

Data Insights

With Data Insights, we help you step by step with the appropriate architecture to use new technologies and develop a data-driven corporate culture: from the development of new data sources, to exploratory analysis to gain new insights, to predictive models.

Service

Service

Gemini

Experience Gemini in Google Cloud and revamp the way you work. Discover innovative Generative AI functions for maximum efficiency.

Kompetenz

GenAI Consulting

ChatGPT, Bard & Co. have shown at the latest: Generative AI has the potential to revolutionize the world of work. With GenAI Consulting, we support you in exploiting this potential for your company.

TIMETOACT

Service

Service

Conception of individual Analytics and Big Data solutions

We determine the best approach to develop an individual solution from the professional, role-specific requirements – suitable for the respective situation!

Rinat AbdullinBlog

Blog

Let's build an Enterprise AI Assistant

In the previous blog post we have talked about basic principles of building AI assistants. Let’s take them for a spin with a product case that we’ve worked on: using AI to support enterprise sales pipelines.

Rinat AbdullinBlog

Blog

Strategic Impact of Large Language Models

This blog discusses the rapid advancements in large language models, particularly highlighting the impact of OpenAI's GPT models.

Felix KrauseBlog

Blog

License Plate Detection for Precise Car Distance Estimation

When it comes to advanced driver-assistance systems or self-driving cars, one needs to find a way of estimating the distance to other vehicles on the road.

Rinat AbdullinBlog

Blog

The Intersection of AI and Voice Manipulation

The advent of Artificial Intelligence (AI) in text-to-speech (TTS) technologies has revolutionized the way we interact with written content. Natural Readers, standing at the forefront of this innovation, offers a comprehensive suite of features designed to cater to a broad spectrum of needs, from personal leisure to educational support and commercial use. As we delve into the capabilities of Natural Readers, it's crucial to explore both the advantages it brings to the table and the ethical considerations surrounding voice manipulation in TTS technologies.

Aqeel AlazreeBlog

Blog

Database Analysis Report

This report comprehensively analyzes the auto parts sales database. The primary focus is understanding sales trends, identifying high-performing products, Analyzing the most profitable products for the upcoming quarter, and evaluating inventory management efficiency.

Aqeel AlazreeBlog

Blog

Part 4: Save Time and Analyze the Database File

ChatGPT-4 enables you to analyze database contents with just two simple steps (copy and paste), facilitating well-informed decision-making.

Aqeel AlazreeBlog

Blog

Part 3: How to Analyze a Database File with GPT-3.5

In this blog, we'll explore the proper usage of data analysis with ChatGPT and how you can analyze and visualize data from a SQLite database to help you make the most of your data.

Aqeel AlazreeBlog

Blog

Part 1: Data Analysis with ChatGPT

In this new blog series we will give you an overview of how to analyze and visualize data, create code manually and how to make ChatGPT work effectively. Part 1 deals with the following: In the data-driven era, businesses and organizations are constantly seeking ways to extract meaningful insights from their data. One powerful tool that can facilitate this process is ChatGPT, a state-of-the-art natural language processing model developed by OpenAI. In Part 1 pf this blog, we'll explore the proper usage of data analysis with ChatGPT and how it can help you make the most of your data.

Rinat AbdullinBlog

Blog

5 Inconvenient Questions when hiring an AI company

This article discusses five questions you should ask when buying an AI. These questions are inconvenient for providers of AI products, but they are necessary to ensure that you are getting the best product for your needs. The article also discusses the importance of testing the AI system on your own data to see how it performs.

Matus ZilinskyBlog

Blog

Creating a Social Media Posts Generator Website with ChatGPT

Using the GPT-3-turbo and DALL-E models in Node.js to create a social post generator for a fictional product can be really helpful. The author uses ChatGPT to create an API that utilizes the openai library for Node.js., a Vue component with an input for the title and message of the post. This article provides step-by-step instructions for setting up the project and includes links to the code repository.

Workshop

AI Workshops for Companies

Whether it's the basics of AI, prompt engineering, or potential scouting: our diverse AI workshop offerings provide the right content for every need.

Rinat AbdullinBlog

Blog

So You are Building an AI Assistant?

So you are building an AI assistant for the business? This is a popular topic in the companies these days. Everybody seems to be doing that. While running AI Research in the last months, I have discovered that many companies in the USA and Europe are building some sort of AI assistant these days, mostly around enterprise workflow automation and knowledge bases. There are common patterns in how such projects work most of the time. So let me tell you a story...

Branche

Artificial Intelligence in Treasury Management

Optimize treasury processes with AI: automated reports, forecasts, and risk management.

Blog

SAM Wins First Prize at AIM Hackathon

The winning team of the AIM Hackathon, nexus. Group AI, developed SAM, an AI-powered ESG reporting platform designed to help companies streamline their sustainability compliance.

Offering

Offering

Advanced Admin Trial

If you’re interested in Advanced Admin, you can test it for free for 14 days via the Marketplace.

Referenz

Automated Planning of Transport Routes

Efficient transport route planning through automation and seamless integration.

TIMETOACT

Referenz

Flexibility in the data evaluation of a theme park

With the support of TIMETOACT, an theme park in Germany has been using TM1 for many years in different areas of the company to carry out reporting, analysis and planning processes easily and flexibly.

Felix KrauseBlog

Blog

Creating a Cross-Domain Capable ML Pipeline

As classifying images into categories is a ubiquitous task occurring in various domains, a need for a machine learning pipeline which can accommodate for new categories is easy to justify. In particular, common general requirements are to filter out low-quality (blurred, low contrast etc.) images, and to speed up the learning of new categories if image quality is sufficient. In this blog post we compare several image classification models from the transfer learning perspective.

TIMETOACT GROUP

Service

Decision Automation

Companies today are faced with the challenge of making increasingly complex decisions in a shorter time frame in order to remain competitive and act in a customer-oriented manner. At the same time, they have a wealth of data at their disposal that can potentially provide valuable insights, but is often difficult to analyze and use. Decision automation is an approach that aims to combine human intelligence with machine algorithms to support or automate better and faster decisions.

TIMETOACT GROUP

Service

Analytics, BI & Planning

In today's business world, data has become a key competitive factor. Companies that are able to collect, analyze and use their data effectively can make better decisions, meet customer needs and identify new opportunities. To achieve this, you need powerful and flexible solutions for Analytics, Business Intelligence (BI) & Planning.

Produkt

Produkt

Advanced Admin Features

All functions have been designed to focus on their simplicity and to make, otherwise cumbersome tasks, quickly performable.

TIMETOACT GROUP

Branche

Branche

Effective claims management for insurers

Insurers have the challenge of helping people quickly and reliably in the event of a claim. At the same time, they have to keep the costs of claims and benefits management low so that insurance premiums remain affordable.

TIMETOACT GROUP

Service

Smart Insurance Workflows

Using a design thinking approach, we orient workflows to the customer experience and design customer-centric end-to-end processes. Intelligent Document Processing enables a high level of dark processing and ensures speed and quality.

Workshop

Gen AI Discovery Workshop

Learn how to push your creative boundaries with Cloudpilots' innovative solution in the Discovery Workshop for Generative AI.

Service

Application Integration & Process Automation

Digitizing and improving business processes and responding agilely to change – more and more companies are facing these kind of challenges. This makes it all the more important to take new business opportunities through integrated and optimized processes based on intelligent, digitally networked systems.

Service

Managed Service: Mailroom

In the TIMETOACT mailroom, business documents are converted into data in a highly efficient manner and returned securely to the end customer for further processing.

Service

Cloud Transformation & Container Technologies

Public, private or hybrid? We can help you develop your cloud strategy so you can take full advantage of the technology.

Service

Software, Mobile and Web App Development

Standard software often cannot completely fulfill a company's own requirements - TIMETOACT therefore develops customized software solutions.

Rinat AbdullinBlog

Blog

Celebrating achievements

Our active memory can be like a cache of recently used data; fresh ideas & frustrations supersede older ones. That's why celebrating achievements is key for your success.

Ian RussellBlog

Blog

Introduction to Web Programming in F# with Giraffe – Part 1

In this series we are investigating web programming with Giraffe and the Giraffe View Engine plus a few other useful F# libraries.

Ian RussellBlog

Blog

Introduction to Web Programming in F# with Giraffe – Part 2

In this series we are investigating web programming with Giraffe and the Giraffe View Engine plus a few other useful F# libraries.

Daniel PuchnerBlog

Blog

Make Your Value Stream Visible Through Structured Logging

Boost your value stream visibility with structured logging. Improve traceability and streamline processes in your software development lifecycle.

Rinat AbdullinBlog

Blog

State of Fast Feedback in Data Science Projects

DSML projects can be quite different from the software projects: a lot of R&D in a rapidly evolving landscape, working with data, distributions and probabilities instead of code. However, there is one thing in common: iterative development process matters a lot.

Felix KrauseBlog

Blog

Part 2: Detecting Truck Parking Lots on Satellite Images

In the previous blog post, we created an already pretty powerful image segmentation model in order to detect the shape of truck parking lots on satellite images. However, we will now try to run the code on new hardware and get even better as well as more robust results.

Felix KrauseBlog

Blog

Part 1: Detecting Truck Parking Lots on Satellite Images

Real-time truck tracking is crucial in logistics: to enable accurate planning and provide reliable estimation of delivery times, operators build detailed profiles of loading stations, providing expected durations of truck loading and unloading, as well as resting times. Yet, how to derive an exact truck status based on mere GPS signals?

Laura GaetanoBlog

Blog

5 lessons from running a (remote) design systems book club

Last year I gifted a design systems book I had been reading to a friend and she suggested starting a mini book club so that she’d have some accountability to finish reading the book. I took her up on the offer and so in late spring, our design systems book club was born. But how can you make the meetings fun and engaging even though you're physically separated? Here are a couple of things I learned from running my very first remote book club with my friend!

Rinat AbdullinBlog

Blog

Event Sourcing with Apache Kafka

For a long time, there was a consensus that Kafka and Event Sourcing are not compatible with each other. So it might look like there is no way of working with Event Sourcing. But there is if certain requirements are met.

Felix KrauseBlog

Blog

Boosting speed of scikit-learn regression algorithms

The purpose of this blog post is to investigate the performance and prediction speed behavior of popular regression algorithms, i.e. models that predict numerical values based on a set of input variables.

Chrystal LantnikBlog

Blog

CSS :has() & Responsive Design

In my journey to tackle a responsive layout problem, I stumbled upon the remarkable benefits of the :has() pseudo-class. Initially, I attempted various other methods to resolve the issue, but ultimately, embracing the power of :has() proved to be the optimal solution. This blog explores my experience and highlights the advantages of utilizing the :has() pseudo-class in achieving flexible layouts.

Daniel PuchnerBlog

Blog

How we discover and organise domains in an existing product

Software companies and consultants like to flex their Domain Driven Design (DDD) muscles by throwing around terms like Domain, Subdomain and Bounded Context. But what lies behind these buzzwords, and how these apply to customers' diverse environments and needs, are often not as clear. As it turns out it takes a collaborative effort between stakeholders and development team(s) over a longer period of time on a regular basis to get them right.

Aqeel AlazreeBlog

Blog

Part 2: Data Analysis with powerful Python

Analyzing and visualizing data from a SQLite database in Python can be a powerful way to gain insights and present your findings. In Part 2 of this blog series, we will walk you through the steps to retrieve data from a SQLite database file named gold.db and display it in the form of a chart using Python. We'll use some essential tools and libraries for this task.

Ian RussellBlog

Blog

Introduction to Functional Programming in F#

Dive into functional programming with F# in our introductory series. Learn how to solve real business problems using F#'s functional programming features. This first part covers setting up your environment, basic F# syntax, and implementing a simple use case. Perfect for developers looking to enhance their skills in functional programming.

Rinat AbdullinBlog

Blog

Machine Learning Pipelines

In this first part, we explain the basics of machine learning pipelines and showcase what they could look like in simple form. Learn about the differences between software development and machine learning as well as which common problems you can tackle with them.

Daniel WellerBlog

Blog

Revolutionizing the Logistics Industry

As the logistics industry becomes increasingly complex, businesses need innovative solutions to manage the challenges of supply chain management, trucking, and delivery. With competitors investing in cutting-edge research and development, it is vital for companies to stay ahead of the curve and embrace the latest technologies to remain competitive. That is why we introduce the TIMETOACT Logistics Simulator Framework, a revolutionary tool for creating a digital twin of your logistics operation.

The best language models for digital products in june 2024

The highlights of the month:

LLM Benchmarks | June 2024

The benchmark categories in detail

Deeper insights

Claude 3.5 Sonnet - Anthropic did it again

Improved performance with Claude 3.5 Sonnet

Small, efficient models are getting better and better

Poorer performing models

Apple Privacy Model and Confidential Computing

What is private cloud compute?

Apple is all about consumer electronics, is there anything comparable for companies?

LLM Benchmarks Archive

Transform your digital projects with the best AI language models!

More on this topic

ChatGPT & Co: LLM Benchmarks for October

ChatGPT & Co: LLM Benchmarks for September

ChatGPT & Co: LLM Benchmarks for November

ChatGPT & Co: LLM Benchmarks for December

ChatGPT & Co: LLM Benchmarks for January

Common Mistakes in the Development of AI Assistants

8 tips for developing AI assistants

AI for social good

Graph Technology

AI Contest - Enterprise RAG Challenge

IBM Cloud Pak for Data Accelerator

AI & Data Science

Open-sourcing 4 solutions from the Enterprise RAG Challenge

Crisis management & building a sustainable future with AI

Expanding Opportunities with Generative AI

Google Workspace: AI-supported work for every company

AIM Hackathon 2024: Sustainability Meets LLMs

Standardized data management creates basis for reporting

The ROI of Gen AI

LLM Performance Series: Batching

AI - A technology is revolutionizing our everyday lives

License Management – Everything you need to know

Third Place - AIM Hackathon 2024: The Venturers

Second Place - AIM Hackathon 2024: Trustpilot for ESG

Interactive online portal identifies suitable employees

Managed service support for optimal license management

Responsible AI: A Guide to Ethical AI Development

Google Threat Intelligence

IBM Cloud Pak for Data – Test-Drive

Digital transformation in public administration

IBM Watson Knowledge Studio

IBM Watson Discovery

IBM Watson Assistant

Mastering digital transformation in insurance

On-site digitization partner for insurance companies

Internal and external security

AI & Digitization for the Transportation and Logistics Indus

Data Insights

Gemini

GenAI Consulting

Conception of individual Analytics and Big Data solutions

Let's build an Enterprise AI Assistant

Strategic Impact of Large Language Models

License Plate Detection for Precise Car Distance Estimation

The Intersection of AI and Voice Manipulation

Database Analysis Report

Part 4: Save Time and Analyze the Database File

Part 3: How to Analyze a Database File with GPT-3.5

Part 1: Data Analysis with ChatGPT

5 Inconvenient Questions when hiring an AI company

Creating a Social Media Posts Generator Website with ChatGPT

AI Workshops for Companies

So You are Building an AI Assistant?

Artificial Intelligence in Treasury Management

SAM Wins First Prize at AIM Hackathon

Advanced Admin Trial

Automated Planning of Transport Routes

Flexibility in the data evaluation of a theme park

Creating a Cross-Domain Capable ML Pipeline

Decision Automation

Analytics, BI & Planning

Advanced Admin Features

Effective claims management for insurers

Smart Insurance Workflows

Gen AI Discovery Workshop