<span class="vcard">/u/nickpsecurity</span>
/u/nickpsecurity

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

https://arxiv.org/abs/2604.05091 Abstract: "We present MegaTrain, a memory-centric system that efficiently trains 100B+ parameter large language models at full precision on a single GPU. Unlike traditional GPU-centric systems, MegaTrain stores par…

Logic-oriented fuzzy neural networks: A survey

https://www.sciencedirect.com/science/article/pii/S0957417424019870 Abstract: "Data analysis and their thorough interpretation have posed a substantial challenge in the era of big data due to increasingly complex data structures and their sheer vo…

Explainability and Interpretability of Multilingual Large Language Models: A Survey

https://aclanthology.org/2025.emnlp-main.1033.pdf Abstract: "Multilingual large language models (MLLMs) demonstrate state-of-the-art capabilities across diverse cross-lingual and multilingual tasks. Their complex internal mechanisms, however, ofte…

H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs

https://arxiv.org/abs/2512.01797 Abstract: "Large language models (LLMs) frequently generate hallucinations — plausible but factually incorrect outputs — undermining their reliability. While prior work has examined hallucinations from macroscopi…

A comprehensive survey of deep learning for time series forecasting: architectural diversity and open challenges

https://link.springer.com/article/10.1007/s10462-025-11223-9 Abstract: "Time series forecasting is a critical task that provides key information for decision-making across various fields, such as economic planning, supply chain management, and med…

A Survey of Bayesian Network Structure Learning

https://arxiv.org/abs/2109.11415 Abstract: "Bayesian Networks (BNs) have become increasingly popular over the last few decades as a tool for reasoning under uncertainty in fields as diverse as medicine, biology, epidemiology, economics and the soc…