/u/jferments

kyutai just introduced Pocket TTS: a 100M-parameter text-to-speech model with high-quality voice cloning that runs on your laptop—no GPU required

/u/jferments January 14, 2026 January 14, 2026

Blog post with demo: Pocket TTS: A high quality TTS that gives your CPU a voice: https://kyutai.org/blog/2026-01-13-pocket-tts GitHub: https://github.com/kyutai-labs/pocket-tts Hugging Face Model Card: https://huggingface.co/kyutai/pocket-tts arXiv:250…

artificial

zai-org/GLM-Image · Hugging Face

/u/jferments January 14, 2026 January 14, 2026

Z.ai (creators of GLM) have released an open weight image generation model that is showing benchmark performance competitive with leading models like Nano Banana 2. "GLM-Image is an image generation model adopts a hybrid autoregressive + dif…

artificial

Signal creator Moxie Marlinspike wants to do for AI what he did for messaging

/u/jferments January 13, 2026 January 13, 2026

"Moxie Marlinspike—the pseudonym of an engineer who set a new standard for private messaging with the creation of the Signal Messenger—is now aiming to revolutionize AI chatbots in a similar way. His latest brainchild is Confer, an open sour…

artificial

Terrence Tao: "Erdos problem #728 was solved more or less autonomously by AI"

/u/jferments January 10, 2026 January 10, 2026

"Recently, the application of AI tools to Erdos problems passed a milestone: an Erdos problem (#728) was solved more or less autonomously by AI (after some feedback from an initial attempt), in the spirit of the problem (as reconstructed by the E…

artificial

AI detects stomach cancer risk from upper endoscopic images in remote communities

/u/jferments January 8, 2026 January 8, 2026

Researchers at National Taiwan University Hospital and the Department of Computer Science & Information Engineering at National Taiwan University developed an AI system made up of several models working together to read stomach images. Trained usi…

artificial

Qwen-Image-2512 released on Huggingface!

/u/jferments December 31, 2025 December 31, 2025

Compared to the base Qwen-Image model released in August, Qwen-Image-2512 features the following key improvements: Enhanced Huamn Realism Qwen-Image-2512 significantly reduces the “AI-generated” look and substantially enhances overall image rea…

artificial

Tencent HY-Motion 1.0 – a billion-parameter text-to-motion model

/u/jferments December 30, 2025 December 30, 2025

submitted by /u/jferments [link] [comments]

artificial

Paper: "Universally Converging Representations of Matter Across Scientific Foundation Models"

/u/jferments December 28, 2025 December 28, 2025

"Machine learning models of vastly different modalities and architectures are being trained to predict the behavior of molecules, materials, and proteins. However, it remains unclear whether they learn similar internal representations of matter. U…

artificial

Microsoft’s TRELLIS 2-4B, An Open-Source Image-to-3D Model

/u/jferments December 17, 2025 December 17, 2025

"An open-source 4B-parameter image-to-3D model producing up to 1536³ PBR textured assets, built on native 3D VAEs with 16× spatial compression, delivering efficient, scalable, high-fidelity asset generation." submitted by …

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: