A comprehensive white paper from the GenAINet Initiative introduces Large Telecom Models (LTMs) as a novel framework for integrating AI into telecommunications infrastructure, providing a detailed roadmap for innovation while addressing critical challenges in scalability, hardware requirements, and regulatory compliance through insights from a diverse coalition of academic, industry and regulatory experts.
View blogResearchers from China Unicom developed CHiSafetyBench, a hierarchical safety benchmark for Chinese Large Language Models, which features a culturally relevant taxonomy, multi-turn conversational scenarios, and an LLM-based automatic evaluation method. Evaluations on mainstream Chinese LLMs showed varying safety capabilities across models and significant performance drops in multi-turn risky dialogues.
View blogThis paper from Unicom Data Intelligence and China Unicom conducted the first comprehensive safety evaluation of DeepSeek-R1 and DeepSeek-V3 models in Chinese contexts using CHiSafetyBench. The study revealed that DeepSeek models exhibit weaker performance in identifying risky content and refusing harmful queries compared to other Chinese LLMs, particularly in the "discrimination" category.
View blog