machine learning machine learning deployment Pretraining with Hierarchical Memories: Separating Long-Tail and Common Knowledge – Apple Machine Learning Research Google Inc. January 9, 2026 January 9, 2026 Pretraining with Hierarchical Memories: Separating Long-Tail and Common Knowledge Apple Machine Learning Research