SAMTok turns any object’s mask in an image into just two special “words” so language models can handle pixels like they handle text.
HeartMuLa is a family of open-source music AI models that can understand and generate full songs with clear lyrics and strong musical structure.