Tencent’s upgraded LLM for text-to-image generation released on open source platforms

Chinese video gaming and social media giant Tencent Holdings launched an upgraded version of its large language model (LLM) with text-to-image generation that is open source for enterprises and individuals.

The eight-month-old Hunyuan large language foundation model developed by Tencent underwent a major upgrade earlier this year, which enhanced its overall performance by 20 per cent compared with the previous version, according to a statement posted on Tuesday on the official WeChat account of Tencent Cloud, the company’s cloud-computing services arm.

Tencent said the latest text-to-image function employs the DiT model architecture, which is also used by OpenAI’s text-to-video tool Sora. The company added that its primary database is in Chinese, enabling the tool to effectively and accurately understand Chinese-language commands.

The complete source code of its text-to-image LLM has been released on US open-source platforms Hugging Face and Github “to benefit the industry as a whole and build an open source ecosystem for next-generation vision generation”, according to the statement.

That means both individuals and enterprises can access the program’s code and modify or share its design, fix broken links, or scale up its capabilities.

Since launching Hunyuan last September, Tencent has integrated its LLM into the company’s various business units, including Tencent Cloud, Tencent Games and super app WeChat. The company said the AI-powered tool has also been provided to over 20 media outlets and advertising firms to facilitate their work.

The launch of the upgraded version came a day after Microsoft-backed OpenAI unveiled its newest GPT model, GPT-4o, which is capable of natural human-computer interaction across text, image, video, and audio.

Open-source technologies have played an important role in facilitating China’s ability to improve its LLMs and catch up with OpenAI’s innovative generative AI tools.

Alibaba Group Holding, owner of the South China Morning Post, has also taken an aggressive move to give third-party developers access to its models after the e-commerce giant launched its self-developed Tongyi Qianwen, or Qwen, LLM last year.

Alibaba Cloud, the company’s cloud-computing unit, has provided access to 76 Qwen text generation models on ModelScope and Hugging Face.

It includes the 72-billion-parameter and 1.8-billion-parameter versions of its LLM. It also freely made available another model that understands audio.

Both Tencent and Alibaba reported better-than-expected profits in the first quarter of 2024.

Shenzhen-based Tencent reported a 62% jump to 41.9bil yuan (RM27.19bil or US$5.8bil) in the first quarter, fuelled by strong advertising revenue, marking its first quarterly profit growth since last June.

Alibaba reported a 10% increase in profit to 79.7bil yuan (RM51.73bil) in the financial year through to the end of March, marking its most profitable year since 2021. – South China Morning Post

Tagged