Language Choice in AI Voice Assistants and Its Effect on User Trust in India

Published Paper PDF: Download PDF

DOI: https://doi.org/10.63345/ijrsml.v13.i8.6

Dr. Tushar Mehrotra
DCSE, Galgotias University

Greater Noida, UP, India
tushar.mehrotra@galgotiasuniversity.edu.in

Abstract

This manuscript examines how the choice of language in AI voice assistants affects user trust among Indian consumers, offering both breadth and depth through a mixed‑methods design. In Phase 1, we conducted a large‑scale online survey (N = 500) stratified across six language conditions—English, Hindi, Tamil, Bengali, Marathi, and Telugu—to quantify trust dimensions (competence, reliability, benevolence) using a validated 5‑point Likert scale. Phase 2 comprised six focus groups (8 participants each), conducted in participants’ primary languages, to elicit nuanced perspectives on pronunciation fidelity, cultural resonance, and emotional rapport. Quantitative results demonstrate that regionally localized voice assistants yield significantly higher trust scores (M ≈ 4.10) than both English‑only (M ≈ 3.65) and Hindi‑only (M ≈ 3.78) interfaces (p < 0.001). Within regional languages, Tamil and Bengali achieved the highest mean trust (M = 4.21 and M = 4.15, respectively). Regression analyses controlling for age, education, and prior VA experience confirm language choice as a robust predictor of trust (β = 0.32–0.45, p < 0.001).

Qualitative themes reveal three primary trust drivers: (1) Comprehension Accuracy—users value precise recognition of regional phonetics and idiomatic expressions, reporting frustration and mistrust when errors occur; (2) Cultural Resonance—tailored dialogue that incorporates local metaphors, greetings for regional festivals, and appropriate honorifics fosters familiarity and perceived benevolence; and (3) Emotional Rapport—native‑language voices are perceived as warmer and more empathetic, strengthening affective trust. A joint display of quantitative and qualitative findings illustrates how cognitive assurance (accuracy) and affective bonding (cultural warmth) synergistically enhance overall trust.

These insights underscore the critical role of comprehensive localization—spanning speech recognition models, NLU components, and dialogue content design—in multilingual societies. For AI developers and policymakers, our study provides actionable guidelines: invest in region‑specific speech corpora, engage local voice talent, and embed culturally meaningful content. By doing so, voice assistants can not only bridge linguistic divides but also cultivate deeper user confidence and drive inclusive technology adoption in India’s diverse landscape.

Keywords

AI voice assistants; language choice; user trust; India; localization

References

https://www.researchgate.net/publication/382738268/figure/fig2/AS:11431281272505448@1724120859349/Flow-chart-of-proposed-Trust-model.png
https://www.researchgate.net/publication/380136662/figure/fig5/AS:11431281261863127@1721648577226/Flowchart-for-the-localization-process.tif
Davis, F. D. (1989). Perceived usefulness, perceived ease of use, and user acceptance of information technology. MIS Quarterly, 13(3), 319–340. https://doi.org/10.2307/249008
Esselink, B. (2000). A practical guide to localization (2nd ed.). John Benjamins.
Gkatzia, D., Messenger, B., & Lemon, O. (2017). Beyond QA: Generating explanatory answers for complex questions. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 848–859. https://doi.org/10.18653/v1/P17-1079
Jain, M., Kumar, P., & Bedi, P. (2020). Designing Hindi language interfaces for voice‑enabled applications: Challenges and solutions. International Journal of Human–Computer Studies, 137, 102380. https://doi.org/10.1016/j.ijhcs.2019.102380
Lee, J. D., & See, K. A. (2004). Trust in automation: Designing for appropriate reliance. Human Factors, 46(1), 50–80. https://doi.org/10.1518/hfes.46.1.50_30392
Liao, Q. V., Davis, M., & Mynatt, E. D. (2020). Culturally adaptive voice assistants: A roadmap. ACM Transactions on Computer–Human Interaction, 27(4), Article 25. https://doi.org/10.1145/3418208
Lankton, N., McKnight, D. H., & Tripp, J. (2015). Technology, humanness, and trust: Rethinking trust in technology. Journal of the Association for Information Systems, 16(10), 880–918. https://doi.org/10.17705/1jais.00418
McKnight, D. H., Choudhury, V., & Kacmar, C. (2002). Developing and validating trust measures for e‑commerce: An integrative typology. Information Systems Research, 13(3), 334–359. https://doi.org/10.1287/isre.13.3.334.81
Nasri, W., & Charfeddine, L. (2012). Customer adoption of internet banking in Tunisia: The influence of perceived credibility and innovativeness. Journal of Retailing and Consumer Services, 19(5), 558–565. https://doi.org/10.1016/j.jretconser.2012.06.002
Oliveira, T. (2013). Mobile banking: A bibliometric analysis and literature review. Scientometrics, 98(1), 366–482. https://doi.org/10.1007/s11192-013-1014-1
Porcheron, M., Fischer, J. E., Reeves, S., & Sharples, S. (2018). Voice interfaces in everyday life. In A. Clark (Ed.), Handbook of Human–Computer Interaction (pp. 231–241). MIT Press.
Ranganathan, A., & Shah, V. (2019). Voice assistants in Indian languages: User perceptions and adoption barriers. Proceedings of the 21st International Conference on Human–Computer Interaction, 290–302. https://doi.org/10.1007/978-3-030-22634-1_22
Singh, P., & Sengupta, S. (2021). Accent adaptation in speech recognition for Indian regional languages. Speech Communication, 126, 1–11. https://doi.org/10.1016/j.specom.2020.10.005
Venkatesh, V., & Bala, H. (2008). Technology acceptance model 3 and a research agenda on interventions. Decision Sciences, 39(2), 273–315. https://doi.org/10.1111/j.1540-5915.2008.00192.x
Wang, Y., Yu, C., & Fesenmaier, D. R. (2020). Cultural influence on voice assistant use: A cross‑national study. Tourism Management, 77, 104016. https://doi.org/10.1016/j.tourman.2019.104016
Waseem, S., & de Melo, G. (2021). Investigating trust formation in conversational AI: Effects of language and genre. Journal of Artificial Intelligence Research, 70, 983–1015. https://doi.org/10.1613/jair.1.12152
Xu, B., & Du, J. (2020). Multilingual speech recognition: Techniques and challenges. IEEE Transactions on Audio, Speech, and Language Processing, 28, 1738–1750. https://doi.org/10.1109/TASLP.2020.2986048
Yadaw, S. R., & Bhardwaj, A. (2022). Understanding rural user trust in voice assistants: A field study in India. Proceedings of the ACM on Human–Computer Interaction, 6(CSCW2), Article 137. https://doi.org/10.1145/3555140
Yang, Q., Steinfeld, A., & Zimmerman, J. (2019). Unremarkable AI: Fitting intelligent decision support into critical, clinical decision‑making processes. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 3(3), Article 107. https://doi.org/10.1145/3351268