Close Menu
  • Home
  • Startups
  • Channeliam Fact Check
  • Middle East
    • Exploring UAE
    • Emerging saudi
  • Shepreneur
    • Top Women Startups
  • Remembering Ratan Tata
  • More
    • Jobs and Internships
    • Funding
    • Entrepreneur
    • Technology
      • Auto
      • Gadgets
    • Updates
    • MSME
    • Movies
    • Travel
    • Events
    • Featured
    • Editor’s Pick
    • Discover and Recover
CHANGE LANGUAGE
What's Hot

DXB to Close as Dubai Transitions to DWC

9 May 2025

 Mona Patel’s Fashion Moment at Met Gala

9 May 2025

Adani and DGPC Partner for Hydropower Projects in Bhutan

9 May 2025
Facebook X (Twitter) Instagram
  • About Us
  • I am Startup Studio
  • I am an Entrepreneur
  • She Power
  • I AM NOW AI
YouTube Facebook Instagram LinkedIn X (Twitter)
Channeliam / Channel I'M EnglishChanneliam / Channel I'M English
  • Home
  • Startups
  • Channeliam Fact Check
  • Middle East
    • Exploring UAE
    • Emerging saudi
  • Shepreneur
    • Top Women Startups
  • Remembering Ratan Tata
  • More
    • Jobs and Internships
    • Funding
    • Entrepreneur
    • Technology
      • Auto
      • Gadgets
    • Updates
    • MSME
    • Movies
    • Travel
    • Events
    • Featured
    • Editor’s Pick
    • Discover and Recover
Change Language
Channeliam / Channel I'M EnglishChanneliam / Channel I'M English
Change Language
Home » ChatGPT-Alike for SouthEast Asia
Technology

ChatGPT-Alike for SouthEast Asia

According to Teo, more than 13% of SEA-LION's data originates from Southeast Asian languages
News DeskBy News Desk10 February 2024No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Email Telegram WhatsApp
Share
Facebook Twitter LinkedIn Pinterest Email Telegram WhatsApp

Many people across Southeast Asia have been exploring the use of large language models like Meta’s Llama 2 and Mistral AI, but they often do so in their native languages such as Bahasa Indonesia or Thai. However, the resulting output in English is typically nonsensical, which puts them at a disadvantage, according to tech experts. This comes as generative artificial intelligence is rapidly changing various aspects of life, including education, work, and governance worldwide. To address this issue, a Singapore government-led initiative has introduced SEA-LION (Southeast Asian Languages in One Network), the region’s first large language model trained on data from 11 Southeast Asian languages including Vietnamese, Thai, and Bahasa Indonesia. This open-sourced model is designed to be more cost-effective and efficient for businesses, governments, and academia in the region, according to Leslie Teo from AI Singapore.

“Do we want to force every person in Southeast Asia to adapt to the machine, or do we want to make it more accessible so people in the region can make full use of the technology without having to be an English speaker?” he said.

“We are not trying to compete with the big LLMs; we are trying to complement them, so there can be better representation of us,” said Teo, senior director for AI products.

Despite there being over 7,000 languages spoken worldwide, large language models (LLMs) like OpenAI’s GPT-4 and Meta’s Llama 2, which are utilized to develop AI systems such as chatbots and other tools, have predominantly been created for and trained on the English language.

Governments and technology firms are actively working to address this disparity. For instance, India is creating datasets in local languages, the United Arab Emirates has developed an LLM powering generative AI tools in Arabic, and there are AI models in China, Japan, and Vietnam that operate in local languages.

According to Nuurrianti Jalli, an assistant professor at Oklahoma State University’s School of Communications, these models can play a crucial role in enabling local populations to more fairly participate in the global AI economy, which is currently largely dominated by big tech firms.

Researchers state that multilingual language models, which are trained on text from multiple languages simultaneously, have the capability to infer semantic and grammatical connections between high-resource languages with ample data and low-resource languages.

These models find applications across various fields, including translation, customer-service chatbots, and content moderation on social media platforms. They particularly aid in identifying hate speech in low-resource languages like Burmese or Amharic, where such detection has been challenging.

According to Teo, more than 13% of SEA-LION’s data originates from Southeast Asian languages, a higher proportion compared to other major multilingual language models. Additionally, over 9% of its data is sourced from Chinese text, while approximately 63% comes from English.

Given that multilingual language models often train on translated text and other data of varying quality, AI Singapore exercises caution in selecting the data used to train SEA-LION, as mentioned by Teo during an interview at the National University of Singapore.

automobiles banner business Channel I Am electric vehicles India latest updates startups technology TOP STORIES
Share. Facebook Twitter Pinterest LinkedIn Email Telegram WhatsApp
News Desk
  • Website

Related Posts

DXB to Close as Dubai Transitions to DWC

9 May 2025

 Mona Patel’s Fashion Moment at Met Gala

9 May 2025

Adani and DGPC Partner for Hydropower Projects in Bhutan

9 May 2025

PSL shifted to UAE over safety fears

9 May 2025
Add A Comment

Comments are closed.

Careers
Recent Posts
  • DXB to Close as Dubai Transitions to DWC
  •  Mona Patel’s Fashion Moment at Met Gala
  • Adani and DGPC Partner for Hydropower Projects in Bhutan
  • PSL shifted to UAE over safety fears
  •  Porter secures $200 million funding
Editors Picks

DXB to Close as Dubai Transitions to DWC

9 May 2025

 Mona Patel’s Fashion Moment at Met Gala

9 May 2025

Adani and DGPC Partner for Hydropower Projects in Bhutan

9 May 2025

PSL shifted to UAE over safety fears

9 May 2025
About Us
About Us

The first exclusive digital video media platform for startups and future business leaders, Channel I’M, the brainchild of Mrs. Nisha Krishan, unveils the first glimpse of how Indian startups think/create/market futuristic products and services.

Subscribe to Updates

Get the latest creative news about entrepreneurs, startups, and businesses.

Updates
  • DXB to Close as Dubai Transitions to DWC
  •  Mona Patel’s Fashion Moment at Met Gala
  • Adani and DGPC Partner for Hydropower Projects in Bhutan
  • PSL shifted to UAE over safety fears
  •  Porter secures $200 million funding
YouTube Facebook X (Twitter) Instagram Pinterest LinkedIn RSS
  • Home
  • About Us
  • Promotions
  • Careers
  • Contact Us
© 2025 Likes and Shares Pvt Ltd. Powered By arbaneo

Type above and press Enter to search. Press Esc to cancel.

Change Language
Malayalam
Hindi
Tamil
Change Language
Malayalam
Hindi
Tamil