scholarly journals Sintesis Suara Bernyanyi Dengan Teknologi Text-To-Speech untuk Notasi Musik Angka dan Lirik Lagu Berbahasa Indonesia

Author(s):  
Jonathan Jonathan ◽  
Yohanes Suyanto

Singing is a work of art that can not be separated from human life. It then makes a research about develop the art of singing by technology will brings a useful impact for such a wide aspect of human life. This research is trying to synthesize singing voice with TTS (text-to-speech) technology, as it capability to produce sound with certain pronunciation at certain frequency of sound. Inputs that used in the system are texts of song in TXT format that contain the information of numbered musical notation and lyrics in Indonesian. These inputs will converted to a phonetic transcription, for then synthesize of song voice can done based on the transcription. In general, the system made successfully synthesize song voices with some feature that based on the convention of numbered musical notation. Based on 30 people of respondents, the song voice synthesized has 81.71% of accuracy with 6.24% of deviation standard. The syntax of song text also reputed as a user-friendly convention with only up to 3 times re-compilation done to synthesize 8 bar of song text by each of respondents without any error.

Author(s):  
Shubham Jain ◽  
Yash Sharma ◽  
Nishi Sharma

Using the concept of Artificial Intelligence, a virtual assistant named "Gabriel" has been Developed to aid in education, market, business and many other fields.[4] The uniqueness of this bot is that it is programmed in python and is stored in raspberry pi providing the user friendly environment by moving along with user. It uses numerous python Libraries to help perform various functions that enable the Assistant to assist its user in day to Day activities. The Assistant can convert text to speech and vice versa using pyttsx3 and speech Recognition libraries respectively. Also, it can scrape the information from Wikipedia and visit any Website. It can be used to surf YouTube and visit any YouTube channel. The Assistant Comes with games such as flappy bird, tic-tac-toe and snake developed by the programmer. Also, the Assistant has an in-built calculator that too developed by the programmer. This Research is to develop an assistant who is highly compatible with human life. This employee comes with a self-made library called ‘the_gmail_sender’ that sends Gmail taking Voice input from the user


2012 ◽  
Vol 21 (2) ◽  
pp. 60-71 ◽  
Author(s):  
Ashley Alliano ◽  
Kimberly Herriger ◽  
Anthony D. Koutsoftas ◽  
Theresa E. Bartolotta

Abstract Using the iPad tablet for Augmentative and Alternative Communication (AAC) purposes can facilitate many communicative needs, is cost-effective, and is socially acceptable. Many individuals with communication difficulties can use iPad applications (apps) to augment communication, provide an alternative form of communication, or target receptive and expressive language goals. In this paper, we will review a collection of iPad apps that can be used to address a variety of receptive and expressive communication needs. Based on recommendations from Gosnell, Costello, and Shane (2011), we describe the features of 21 apps that can serve as a reference guide for speech-language pathologists. We systematically identified 21 apps that use symbols only, symbols and text-to-speech, and text-to-speech only. We provide descriptions of the purpose of each app, along with the following feature descriptions: speech settings, representation, display, feedback features, rate enhancement, access, motor competencies, and cost. In this review, we describe these apps and how individuals with complex communication needs can use them for a variety of communication purposes and to target a variety of treatment goals. We present information in a user-friendly table format that clinicians can use as a reference guide.


Micromachines ◽  
2021 ◽  
Vol 12 (6) ◽  
pp. 697
Author(s):  
Siming Lu ◽  
Sha Lin ◽  
Hongrui Zhang ◽  
Liguo Liang ◽  
Shien Shen

Respiratory viral infections threaten human life and inflict an enormous healthcare burden worldwide. Frequent monitoring of viral antibodies and viral load can effectively help to control the spread of the virus and make timely interventions. However, current methods for detecting viral load require dedicated personnel and are time-consuming. Additionally, COVID-19 detection is generally relied on an automated PCR analyzer, which is highly instrument-dependent and expensive. As such, emerging technologies in the development of respiratory viral load assays for point-of-care (POC) testing are urgently needed for viral screening. Recent advances in loop-mediated isothermal amplification (LAMP), biosensors, nanotechnology-based paper strips and microfluidics offer new strategies to develop a rapid, low-cost, and user-friendly respiratory viral monitoring platform. In this review, we summarized the traditional methods in respiratory virus detection and present the state-of-art technologies in the monitoring of respiratory virus at POC.


Author(s):  
Reeta Sharma ◽  
P. K. Bhattacharya ◽  
Shantanu Ganguly ◽  
Arun Kumar

Today's world is technology-driven. Technology has penetrated almost every sphere of human life. Digital marking is one of the technologies that have attracted people from different age groups all over the world with their advanced nature of applications and uses. One of the foremost reasons why patrons like to use this technology is because these are not only user-friendly in nature and innovativeness but also carry the knowledge economies. Marketing and branding through digital media channels are very decent ventures that have steadily increased in value and are thereby considered safe and secure investments. In this chapter, the authors discuss a case study of ICDL 2016 conference where social media and other technology is widely used to market this event and catch prospective users.


2020 ◽  
Vol 12 (3) ◽  
pp. 168781401988309
Author(s):  
Zijia Zhong ◽  
Joyoung Lee

Accessible pedestrian signal was proposed as a mean to achieve the same level of service that is set forth by the Americans with Disabilities Act for the visually impaired. One of the major issues of existing accessible pedestrian signals is the failure to deliver adequate crossing information for the visually impaired. This article presents a mobile-based accessible pedestrian signal application, namely, Virtual Guide Dog. Integrating intersection information and onboard sensors (e.g. GPS, compass, accelerometer, and gyroscope sensor) of modern smartphones, the Virtual Guide Dog application can notify the visually impaired: (1) the close proximity of an intersection and (2) the street information for crossing. By employing a screen tapping interface, Virtual Guide Dog can remotely place a pedestrian crossing call to the controller, without the need of using a pushbutton. In addition, Virtual Guide Dog informs VIs the start of a crossing phase using text-to-speech technology. The proof-of-concept test shows that Virtual Guide Dog keeps the users informed about the remaining distance as they are approaching the intersection. It was also found that the GPS-only mode is accompanied by greater distance deviation compared to the mode jointly operating with both GPS and cellular positioning.


2012 ◽  
Vol 433-440 ◽  
pp. 4883-4887
Author(s):  
Hong Li Yang ◽  
Yun Yang ◽  
Zhu Yue

TTS, namely text-to-speech, is a kind of technology who can convert text information into sound signal according to information Speech processing rules. TTS, as the synthetic technology of the pronunciation, is the key technology in the current development of computer technology, and one of the most forward technical in its voice service, telephone banking, and information home appliances, mobile PDA fields. TTS has its extensive applications. In this paper, TTS is applied to electronic speech reader, which changes traditional way to read e-book, and both listening to and novels and learning English. This article introduces a method about how to make use of TTS technology, and how to achieve an electronic Speech reader of programming based on Visual Studio C# 2008 environment bring API and Microsoft SAPI interface.


2020 ◽  
pp. 205-212
Author(s):  
Georgina Kleege

The author recounts her history as an aural reader and argues for her preference for the synthesized voices of text-to-speech technology over analogue recordings of human voices. Legally blind since the age of 11, she developed habits of good listening, which served to elevate her aural reading from the passive reception of oral language to a more active practice of aural discernment. Now, with the widespread popularity of audio books and the ubiquity of synthesized voice technologies in mainstream electronic devices, she perceives progress toward greater social inclusion for people who are blind and visually impaired.


Artificial Intelligence and Machine Learning are driving IT industry to new landscape. This system “The TalkBot” overcomes this problem and provides farmers the better opportunity to obtain the desired information and to scale up with upcoming market trends and technologies in a user friendly manner. TalkBot is actually a chatbot, which is a virtual conversational assistant, through which the users can communicate with the bot as if they are conversing with humans. The focus is on developing the bot in a more intellectual way, that it can even recognize not so well grammatically defined sentences, misspelled words, incomplete phrases, etc.,. This can help people to converse easily with the bot, since this system uses the Natural Language Processing technique to parse the user queries, identify the key words, match them with Knowledge Base and respond with the accurate results. To make the responses more understandable, the responses are generated using classification algorithms and produce non textual responses so that it can be easily perceived by the users. Bot also has an ability to provide voice oriented responses using text to speech techniques..


2011 ◽  
Vol 1 (1) ◽  
pp. 31-53
Author(s):  
Rubén San-Segundo ◽  
Carlos D. Martínez-Hinarejos ◽  
Alfonso Ortega

In the last two decades, there has been an important increase in research on speech technology in Spain, mainly due to a higher level of funding from European, Spanish and local institutions and also due to a growing interest in these technologies for developing new services and applications. This paper provides a review of the main areas of speech technology addressed by research groups in Spain, their main contributions in the recent years and the main focus of interest these days. This description is classified in five main areas: audio processing including speech, speaker characterization, speech and language processing, text to speech conversion and spoken language applications. This paper also introduces the Spanish Network of Speech Technologies (RTTH. Red Temática en Tecnologías del Habla) as the research network that includes almost all the researchers working in this area, presenting some figures, its objectives and its main activities developed in the last years.


Sign in / Sign up

Export Citation Format

Share Document