Apple's MM1: A multimodal large language model capable of interpreting both images and text data
A team of computer scientists and engineers at Apple has developed an large language model (LLM) that the company claims can interpret both images and data. The group has posted a paper to the arXiv preprint server describing ...