专栏首页arxiv.org翻译专栏使用梵文语法改善具有数字词源的多语言国家的电子治理和移动治理(CS.CY)
原创

使用梵文语法改善具有数字词源的多语言国家的电子治理和移动治理(CS.CY)

随着数字连接(Wifi,3G,4G)和数字设备的巨大改进,如今已经可以在最偏远的角落访问互联网。农村居民可以轻松地通过PDA,笔记本电脑,智能手机等访问Web或应用程序。这是政府的一个机会,可以在不部署大量人力,物力的情况下,与众多公民接触,获取反馈,将其与电子政务联系起来。或资源。但是,由于农村人口倾向于并倾向于使用母语进行互动,因此多语言国家政府在成功实施政府对公民(G2C)和公民对政府(C2G)治理方面面临许多问题。通过网络或应用程序向不同语言的演讲者群体提供平等的体验是一个真正的挑战。在这项研究中,我们理清了讲印支雅利安语的网民所面临的问题,这些问题通常也适用于任何语言族群或亚群。然后,我们尝试使用词源学给出可能的解决方案。词源用于使用词的词根形式将词相关联。公元前5世纪,帕尼尼(Panini)写了阿斯塔德(Astadhyayi),他在那儿描写经文或规则-单词如何根据人,时态,性别,数字等而变化。后来,这本书在西方国家也得到了推广,以衍生出其较新语言的语法。我们已经训练了我们的系统,可以使用Panian语法规则从单词的表面级别或变形形式中自动提取词根。我们已经测试了超过10000个孟加拉动词的系统,并以98%的精度提取了根形式。现在,我们正在努力扩展程序,以成功地对任何语言的单词进行词素化,并通过在人工神经网络中应用这些规则集将它们相关联。

原文标题:Improvement of electronic Governance and mobile Governance in Multilingual Countries with Digital Etymology using Sanskrit Grammar

原文:With huge improvement of digital connectivity (Wifi,3G,4G) and digital devices access to internet has reached in the remotest corners now a days. Rural people can easily access web or apps from PDAs, laptops, smartphones etc. This is an opportunity of the Government to reach to the citizen in large number, get their feedback, associate them in policy decision with e governance without deploying huge man, material or resourses. But the Government of multilingual countries face a lot of problem in successful implementation of Government to Citizen (G2C) and Citizen to Government (C2G) governance as the rural people tend and prefer to interact in their native languages. Presenting equal experience over web or app to different language group of speakers is a real challenge. In this research we have sorted out the problems faced by Indo Aryan speaking netizens which is in general also applicable to any language family groups or subgroups. Then we have tried to give probable solutions using Etymology. Etymology is used to correlate the words using their ROOT forms. In 5th century BC Panini wrote Astadhyayi where he depicted sutras or rules -- how a word is changed according to person,tense,gender,number etc. Later this book was followed in Western countries also to derive their grammar of comparatively new languages. We have trained our system for automatic root extraction from the surface level or morphed form of words using Panian Gramatical rules. We have tested our system over 10000 bengali Verbs and extracted the root form with 98% accuracy. We are now working to extend the program to successfully lemmatize any words of any language and correlate them by applying those rule sets in Artificial Neural Network.

原文作者:Arijit Das, Diganta Saha

原文地址:https://arxiv.org/abs/2004.00104

原创声明,本文系作者授权云+社区发表,未经许可,不得转载。

如有侵权,请联系 yunjia_community@tencent.com 删除。

我来说两句

0 条评论
登录 后参与评论

相关文章

  • 从模拟到真实的转移以实现光学触觉(CS.RO)

    深度学习和强化学习方法已被证明可以实现灵活而复杂的机器人控制器的学习。但是,对大量训练数据的依赖经常要求在模拟中进行数据收集,近年来,人们开发了许多模拟到真实的...

    蔡小雪7100294
  • 具有学术论文链接的GitHub存储库:开放访问,可追溯性和演进(CS.SE)

    在已发布的科学突破及其实现之间的可追溯性至关重要,尤其是在开源软件将前沿科学实现到其代码中的情况下。但是,对齐GitHub存储库和学术论文之间的链接可能会很困难...

    蔡小雪7100294
  • 由人的密集姿势识别转移到邻近动物类识别(CS.CV)

    最新的研究表明,给定详细注释的大型姿势数据集,可以密集而准确地识别人的姿势。原则上,相同的方法可以扩展到任何动物类别,但是尽管在自然保护,科学和商业中有重要应用...

    蔡小雪7100294
  • Monolithic vs Microservice Architecture- Pros and Cons

    The hassle that large scale enterprise applications under development bring to t...

    用户4822892
  • 论文解读:主视觉大脑皮层的深度层级模型:机器视觉可以从中学到些什么?

    、论文:Deep Hierarchies in the Primate Visual Cortex: What Can We Learn for Compute...

    用户1908973
  • Watson Uses Cognitive Computing To Improve People's Lives

    IDC predicts that by 2018, half of all consumers will interact with services bas...

    首席架构师智库
  • 多处理器系统中具有多个临界段的实时任务的安排(CS OS)

    多处理器同步和锁定协议的性能是在实时约束下利用多处理器系统计算能力的关键因素。虽然在过去的几十年里已经开发了多种协议,但它们的性能在很大程度上取决于任务划分和优...

    邱邱邱
  • 测量异构信息网络的多样性(CS AI)

    多样性是一个与许多研究领域相关的概念,从生态学到信息论,再到经济学,举几个例子。这个概念在信息检索、网络分析和人工神经网络社区中得到了越来越多的关注。虽然在网络...

    用户6853689
  • The Rise of Cognitive Business

    When the original Watson won on the TV quiz show Jeopardy! in 2011, it was one c...

    首席架构师智库
  • 信息访问悖论:共享时代的日益孤立(CS CAS)

    Twitter,Instagram和YouTube等现代在线媒体使任何人都可以成为信息生产者,并提供在线内容供潜在的全球消费。通过增加全球可访问的实时信息的数量...

    时代在召唤

扫码关注云+社区

领取腾讯云代金券