News Ticker

[ 5. Dezember 2025 ] Diodes: 4-Kanal-Boost-Controller für LED-Displays News
[ 5. Dezember 2025 ] Volkswagen: Gläserne Manufaktur wird zum Innovationscampus Branchen-News
[ 4. Dezember 2025 ] KBA: BEV überholen im November Plug-In-Hybride bei Neuzulassungen Branchen-News
[ 4. Dezember 2025 ] Vector: Version 24 des Kalibrierungstools CANape News
[ 4. Dezember 2025 ] Quintauris: RT-Europa-Plattform mit Microsar-Classic-Stack News
[ 3. Dezember 2025 ] dSPACE: Übernahme von dissecto Branchen-News
[ 3. Dezember 2025 ] IPG: Partnerschaft mit b-plus Branchen-News
[ 3. Dezember 2025 ] Infineon: ITC bestätigt Patentrechtsverletzung Branchen-News
[ 2. Dezember 2025 ] Synopsys und NVIDIA integrieren KI- und GPU-Computing mit Engineeringlösungen Branchen-News
[ 2. Dezember 2025 ] TTTech Auto: Vertiefte Kooperation mit koranischem OEM Branchen-News

LLMs such as ChatGPT fail even with simple logic tasks

4. September 2024 English Content

ChatGPT-Screenshot — ChatGPT 4's response to a logic question posed on 8/14/2014 proves inadequacy of LLMs.

Even the best AI language models fail dramatically when it comes to logical questions. This is the conclusion reached by researchers from the Jülich Supercomputing Centre (JSC), the School of Electrical and Electronic Engineering at the University of Bristol and the LAION AI laboratory. In their paper, „Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models“, the researchers attest to a „severe breakdown in functional and reasoning ability“ in the state-of-the-art LLMs tested and suggest that although language models have the basic ability to draw conclusions, they cannot reliably retrieve them. They call on the scientific and technological community to stimulate an urgent reassessment of the claimed capabilities of the current generation of LLM. Furthermore, they call for the development of standardized benchmarks to uncover weaknesses in language models‘ reasoning abilities – as current tests have apparently failed to detect this serious flaw. (jr)

Link to the original message

Link to the pre-view of the research paper

AEEmobility

Der Information Hub für Automobilelektronikentwickler

LLMs such as ChatGPT fail even with simple logic tasks

Ähnliche Artikel

Erster elektrisch gepumpter, kontinuierlicher Laser für die nahtlose Integration in Si-Chips

Blogbeitrag: Large Language Models in der Fahrzeugentwicklung

Cerence AI and SiMa.ai bring energy-efficient voice AI into the vehicle