AI research

ChatGPT beats doctors at answering online medical questions, study finds

Summary According to a recent study, ChatGPT surpasses the quality and empathy of physicians when responding to online queries. However, there are some caveats. A recent study published in JAMA Internal Medicine reveals that ChatGPT outperforms physicians in terms of quality and empathy when responding to online queries. The study evaluated ChatGPT’s performance compared to …

ChatGPT beats doctors at answering online medical questions, study finds Read More »

Can AI models learn why giraffes are giraffes?

Summary Multimodal AI models can caption images and answer questions about them – but their answers don’t always make sense. Can they learn from humans? Vision-Language Models (VLMs) combine transformer-based language models with computer vision for image captioning, answering questions about them, or, conversely, judging how well an image description matches an image. There are …

Can AI models learn why giraffes are giraffes? Read More »

Elon Musk’s ‘TruthGPT’ is complicated, says OpenAI co-founder

Summary Hallucinations are one of the central problems of large language models. An OpenAI co-founder explains why a TruthGPT is so difficult. Elon Musk’s X.AI wants to build “TruthGPT”, an honest language model – referring not only to classic cases of hallucination where systems like ChatGPT generate false outputs but also to reports that ChatGPT …

Elon Musk’s ‘TruthGPT’ is complicated, says OpenAI co-founder Read More »

Why ChatGPT and other language AIs don’t know what they’re saying

Newsletter Will large language models ever understand words the way we do? A psychologist and a cognitive scientist investigate. When we asked GPT-3an extremely powerful and popular artificial intelligence language system, whether you’d be more likely to use a paper map or a stone to fan life into coals for a barbecue, it preferred the …

Why ChatGPT and other language AIs don’t know what they’re saying Read More »

Reddit ends its role as a free AI training data goldmine

Summary Reddit plays a central role in training large language models. Now the social network is looking to monetize its data. OpenAI’s GPT-3.5 or GPT-4, Meta’s LLaMA, or Google’s Bard – large language models are trained on Internet text, and a significant portion of the training data comes from Reddit threads. The fact that this …

Reddit ends its role as a free AI training data goldmine Read More »

Microsoft edges closer to zero-shot voice cloning

Summary Microsoft presents NaturalSpeech 2, a text-to-speech model that is based on diffusion models and is capable of cloning any voice with just a short snippet of audio. Microsoft Research Asia and Microsoft Azure Speech developed NaturalSpeech 2 using a diffusion model that interacts with a Neural Audio codec, which compresses waveforms into vectors. The …

Microsoft edges closer to zero-shot voice cloning Read More »

Nvidia shows text-to-video for Stable Diffusion

Summary Nvidia turns Stable Diffusion into a text-to-video model, generates high-resolution video, and shows how the model can be personalized. Nvidia’s generative AI model is based on diffusion models and adds a temporal dimension that enables temporal-aligned image synthesis over multiple frames. The team trains a video model to generate several minutes of video of …

Nvidia shows text-to-video for Stable Diffusion Read More »

Metas DINOv2 is a foundation model for computer vision

Summary Metas DINOv2 is a foundation model for computer vision. The company shows its strengths and wants to combine DINOv2 with large language models. In May 2021, AI researchers at Meta presented DINO (Self-Distillation with no labels), a self-supervised trained AI model for image tasks such as classification or segmentation. With DINOv2, Meta is now …

Metas DINOv2 is a foundation model for computer vision Read More »

An old AI architecture shows off some new tricks

Summary GigaGAN shows that Generative Adversarial Networks are far from obsolete and could be a faster alternative to Stable Diffusion in the future. Current generative AI models for images are diffusion models trained on large datasets that generate images based on text descriptions. They have replaced GANs (Generative Adversarial Network), which were widely used in …

An old AI architecture shows off some new tricks Read More »

Instruct-NeRF2NeRF lets you edit NeRFs via text prompt

Summary Instruct-NeRF2NeRF uses methods of generative AI models and can edit 3D scenes according to text input. Earlier this year, researchers at the University of California Berkeley demonstrated InstructPix2Pix, a method that allows users to edit images in Stable Diffusion using text instructions. The method makes it possible to replace objects in images or change …

Instruct-NeRF2NeRF lets you edit NeRFs via text prompt Read More »

Scroll to Top