The Science and Information (SAI) Organization
  • Home
  • About Us
  • Journals
  • Conferences
  • Contact Us

Publication Links

  • IJACSA
  • Author Guidelines
  • Publication Policies
  • Outstanding Reviewers

IJACSA

  • About the Journal
  • Call for Papers
  • Editorial Board
  • Author Guidelines
  • Submit your Paper
  • Current Issue
  • Archives
  • Indexing
  • Fees/ APC
  • Reviewers
  • Apply as a Reviewer

IJARAI

  • About the Journal
  • Archives
  • Indexing & Archiving

Special Issues

  • Home
  • Archives
  • Proposals
  • ICONS_BA 2025

Computer Vision Conference (CVC)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Computing Conference

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Intelligent Systems Conference (IntelliSys)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact

Future Technologies Conference (FTC)

  • Home
  • Call for Papers
  • Submit your Paper/Poster
  • Register
  • Venue
  • Contact
  • Home
  • Call for Papers
  • Editorial Board
  • Guidelines
  • Submit
  • Current Issue
  • Archives
  • Indexing
  • Fees
  • Reviewers
  • RSS Feed

DOI: 10.14569/IJACSA.2025.0161160
PDF

HeritageLM: Culturally-Aware Multimodal Language Modeling with Memory-Enhanced Cross-Dialect Adaptation

Author 1: Pasupuleti Venkata Ramana
Author 2: K. K. Sunalini
Author 3: A. Swathi
Author 4: R. Lakshmi
Author 5: Raman Kumar
Author 6: Elangovan Muniyandy
Author 7: Khaled Bedair

International Journal of Advanced Computer Science and Applications(IJACSA), Volume 16 Issue 11, 2025.

  • Abstract and Keywords
  • How to Cite this Article
  • {} BibTeX Source

Abstract: The HeritageLM system solves the acute problem of language loss by proposing a multimodal language model that brings into Generative Memory the cultural context. Manual documentation and linguistic archiving are the traditional ways of preserving dialects, which may not be effective in preserving the phonetic variety and other cultural peculiarities of the face of an endangered dialect. The current NLP models, such as BERT and GPT, are not effective to produce dialectal content because they do not have exposure to under-resourced and historically rich language varieties. These shortcomings are mitigated by training Cultural Contextual Embeddings (CCE), Generative Memory Augmentation (GMA), and Cross-Dialect Contrastive Transfer Learning (CDCP) using reinforcement learning with Cultural Rewards (RLCR). It is a step-by-step process that builds a Multimodal Cultural Knowledge Graph (MCKG), matches dialect embeddings in contrastive learning, and retrieves culturally relevant information in the generation process. The model was trained on the Indian Languages Audio Dataset of Kaggle, which also included phonetic variations of ten languages with preprocessing steps of text-to-speech analysis, phonetic annotation, and semantic tagging. HeritageLM, which was implemented in Python, scored above 98 in its BLEU, ROUGE-L, phonetic accuracy, and cultural embedding, showing that it can effectively generate linguistically accurate, phonetically accurate, and culturally authentic results. These outcomes are a major step towards the resurrection of dying dialects and maintaining their distinct cultural background.

Keywords: Cultural embeddings; Generative Memory; dialect revitalization; contrastive learning; multimodal NLP

Pasupuleti Venkata Ramana, K. K. Sunalini, A. Swathi, R. Lakshmi, Raman Kumar, Elangovan Muniyandy and Khaled Bedair. “HeritageLM: Culturally-Aware Multimodal Language Modeling with Memory-Enhanced Cross-Dialect Adaptation”. International Journal of Advanced Computer Science and Applications (IJACSA) 16.11 (2025). http://dx.doi.org/10.14569/IJACSA.2025.0161160

@article{Ramana2025,
title = {HeritageLM: Culturally-Aware Multimodal Language Modeling with Memory-Enhanced Cross-Dialect Adaptation},
journal = {International Journal of Advanced Computer Science and Applications},
doi = {10.14569/IJACSA.2025.0161160},
url = {http://dx.doi.org/10.14569/IJACSA.2025.0161160},
year = {2025},
publisher = {The Science and Information Organization},
volume = {16},
number = {11},
author = {Pasupuleti Venkata Ramana and K. K. Sunalini and A. Swathi and R. Lakshmi and Raman Kumar and Elangovan Muniyandy and Khaled Bedair}
}



Copyright Statement: This is an open access article licensed under a Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, even commercially as long as the original work is properly cited.

IJACSA

Upcoming Conferences

Computer Vision Conference (CVC) 2026

21-22 May 2026

  • Amsterdam, The Netherlands

Computing Conference 2026

9-10 July 2026

  • London, United Kingdom

Artificial Intelligence Conference 2026

3-4 September 2026

  • Amsterdam, The Netherlands

Future Technologies Conference (FTC) 2026

15-16 October 2026

  • Berlin, Germany
The Science and Information (SAI) Organization
BACK TO TOP

Computer Science Journal

  • About the Journal
  • Call for Papers
  • Submit Paper
  • Indexing

Our Conferences

  • Computer Vision Conference
  • Computing Conference
  • Intelligent Systems Conference
  • Future Technologies Conference

Help & Support

  • Contact Us
  • About Us
  • Terms and Conditions
  • Privacy Policy

The Science and Information (SAI) Organization Limited is a company registered in England and Wales under Company Number 8933205.