Toggle light / dark theme

Researcher finds way to get audio from still images and silent videos

With video calls becoming more common in the age of remote and hybrid workplaces, “mute yourself” and “I think you’re muted” have become part of our everyday vocabularies. But it turns out muting yourself might not be as safe as you think.

Kevin Fu, a professor of electrical and and at Northeastern University, has figured out a way to get audio from pictures and even muted videos. Using Side Eye, a assisted tool that Fu and his research team created, Fu can determine the gender of someone speaking in the room where a photo was taken—and even the exact words they spoke.

“Imagine someone is doing a TikTok video and they mute it and dub music,” Fu says. “Have you ever been curious about what they’re really saying? Was it ‘Watermelon watermelon’ or ‘Here’s my password?’ Was somebody speaking behind them? You can actually pick up what is being spoken off camera.”

ChatGPT can now see, hear, and speak

We are beginning to roll out new voice and image capabilities in ChatGPT. They offer a new, more intuitive type of interface by allowing you to have a voice conversation or show ChatGPT what you’re talking about.

Voice and image give you more ways to use ChatGPT in your life. Snap a picture of a landmark while traveling and have a live conversation about what’s interesting about it. When you’re home, snap pictures of your fridge and pantry to figure out what’s for dinner (and ask follow up questions for a step by step recipe). After dinner, help your child with a math problem by taking a photo, circling the problem set, and having it share hints with both of you.

We’re rolling out voice and images in ChatGPT to Plus and Enterprise users over the next two weeks. Voice is coming on iOS and Android (opt-in in your settings) and images will be available on all platforms.

Robot car talk: Introducing Wayve’s new AI model LINGO-1

Someone has made it. LONDON—(BUSINESS WIRE)—Self-driving technology company Wayve has launched LINGO-1, a first-of-its-kind vision-language-action model (VLAM) for self-driving that is set to revolutionise the learning and explainability of its AI Driver technology powering self-driving vehicles.

Prior to LINGO-1, end-to-end AI neural nets have been criticised as black boxes, providing limited insight into why and how they make decisions. Gaining deeper insight into the decision-making and reasoning capabilities of its AI models is critical to ensuring that Wayve can build a safe driving intelligence for self-driving. Wayve’s LINGO-1 opens up new capabilities to dramatically enhance the interpretability of Wayve’s AI Driver.

Home


Wayve launches a breakthrough AI model, LINGO-1, that uses natural language to improve AI explainability for self-driving vehicles.

Three steps healthcare organizations can take to use generative AI responsibly

The healthcare lead at research and consulting giant Accenture lays out how to get proprietary data ready, establish the right controls and harmonize people with the tech.

Many healthcare organizations are onboarding generative AI fast and furious. Generative is the kind of AI behind the super popular ChatGPT application.

While it may seem like a miracle technology to many, it is by no means perfected. In fact, it even can have hallucinations (known to us humans as mistakes).

Mercedes-Benz Folds Next-Gen Luxury EV Platform Into Production With NVIDIA Omniverse and Generative AI

Mercedes-Benz is using digital twins for production with help from NVIDIA Omniverse, a platform for developing Universal Scene Description (OpenUSD) applications to design, collaborate, plan and operate manufacturing and assembly facilities.

Mercedes-Benz’s new production techniques will bring its next-generation vehicle portfolio into its manufacturing facilities operating in Rastatt, Germany; Kecskemét, Hungary; and Beijing, China — and offer a blueprint for its more than 30 factories worldwide. This “Digital First” approach enhances efficiency, avoids defects and saves time, marking a step-change in the flexibility, resilience and intelligence of the Mercedes-Benz MO360 production system.

The digital twin in production helps ensure Mercedes-Benz assembly lines can be retooled, configured and optimized in physically accurate simulations first. The new assembly lines in the Kecskemét plant will enable production of vehicles based on the newly launched Mercedes Modular Architecture that are developed virtually using digital twins in Omniverse.

Artificial Intelligence in Brain Tumour Surgery—An Emerging Paradigm

Artificial intelligence (AI) is the branch of computer science that enables machines to learn, reason, and problem solve. In recent decades, AI has been developed with the aim of improving the management of patients with brain tumours. This review article explores the role AI currently plays in managing patients undergoing brain tumour surgery, and explores how AI may impact this field in the future.

Artificial intelligence (AI) platforms have the potential to cause a paradigm shift in brain tumour surgery. Brain tumour surgery augmented with AI can result in safer and more effective treatment. In this review article, we explore the current and future role of AI in patients undergoing brain tumour surgery, including aiding diagnosis, optimising the surgical plan, providing support during the operation, and better predicting the prognosis. Finally, we discuss barriers to the successful clinical implementation, the ethical concerns, and we provide our perspective on how the field could be advanced.

Keywords: artificial intelligence, AI, neurosurgery, brain tumour, machine learning, deep learning, surgery, oncology.

/* */