scholarly journals Optimization of Data Mining and Analysis System for Chinese Language Teaching Based on Convolutional Neural Network

2021 ◽  
Vol 2021 ◽  
pp. 1-8
Author(s):  
Xi Chen

Chinese language is also an important way to understand Chinese culture and an important carrier to inherit and carry forward Chinese traditional culture. Chinese language teaching is an important way to inherit and develop Chinese language. Therefore, in the era of big data, data mining and analysis of Chinese language teaching can effectively sum up experience and draw lessons, so as to improve the quality of Chinese language teaching and promote Chinese language culture. Text clustering technology can analyze and process the text information data and divide the text information data with the same characteristics into the same category. Based on big data, combined with convolutional neural network and K-means algorithm, this paper proposes a text clustering method based on convolutional neural network (CNN), constructs a Chinese language teaching data mining analysis system, and optimizes it so that the system can better mine Chinese character data in Chinese language teaching data in depth and comprehensively. The results show that the optimized k-means algorithm needs 683 iterations to achieve the target accuracy. The average K-measure value of the optimized system is 0.770, which is higher than that of the original system. The results also show that K-means algorithm can significantly improve the clustering effect, optimize the data mining analysis system of Chinese language teaching, and deeply mine the Chinese data in Chinese language teaching, so as to improve the quality of Chinese language teaching.

2019 ◽  
Vol 8 (3) ◽  
pp. 6634-6643 ◽  

Opinion mining and sentiment analysis are valuable to extract the useful subjective information out of text documents. Predicting the customer’s opinion on amazon products has several benefits like reducing customer churn, agent monitoring, handling multiple customers, tracking overall customer satisfaction, quick escalations, and upselling opportunities. However, performing sentiment analysis is a challenging task for the researchers in order to find the users sentiments from the large datasets, because of its unstructured nature, slangs, misspells and abbreviations. To address this problem, a new proposed system is developed in this research study. Here, the proposed system comprises of four major phases; data collection, pre-processing, key word extraction, and classification. Initially, the input data were collected from the dataset: amazon customer review. After collecting the data, preprocessing was carried-out for enhancing the quality of collected data. The pre-processing phase comprises of three systems; lemmatization, review spam detection, and removal of stop-words and URLs. Then, an effective topic modelling approach Latent Dirichlet Allocation (LDA) along with modified Possibilistic Fuzzy C-Means (PFCM) was applied to extract the keywords and also helps in identifying the concerned topics. The extracted keywords were classified into three forms (positive, negative and neutral) by applying an effective machine learning classifier: Convolutional Neural Network (CNN). The experimental outcome showed that the proposed system enhanced the accuracy in sentiment analysis up to 6-20% related to the existing systems.


2021 ◽  
Vol 11 (6) ◽  
pp. 2838
Author(s):  
Nikitha Johnsirani Venkatesan ◽  
Dong Ryeol Shin ◽  
Choon Sung Nam

In the pharmaceutical field, early detection of lung nodules is indispensable for increasing patient survival. We can enhance the quality of the medical images by intensifying the radiation dose. High radiation dose provokes cancer, which forces experts to use limited radiation. Using abrupt radiation generates noise in CT scans. We propose an optimal Convolutional Neural Network model in which Gaussian noise is removed for better classification and increased training accuracy. Experimental demonstration on the LUNA16 dataset of size 160 GB shows that our proposed method exhibit superior results. Classification accuracy, specificity, sensitivity, Precision, Recall, F1 measurement, and area under the ROC curve (AUC) of the model performance are taken as evaluation metrics. We conducted a performance comparison of our proposed model on numerous platforms, like Apache Spark, GPU, and CPU, to depreciate the training time without compromising the accuracy percentage. Our results show that Apache Spark, integrated with a deep learning framework, is suitable for parallel training computation with high accuracy.


2021 ◽  
Author(s):  
Feng He ◽  
Hongjiang Liu ◽  
Chunxue Liu ◽  
Guangjing Bao

Abstract To ensure the proper adoption of new technologies in identifying the potential geologic hazard on tourist routes, convolutional neural network (CNN) technology is applied in the radar image geologic hazard information extraction. A scientific and practical geologic hazard radar identification model is built, which is based on CNN’s image identification and big data algorithm calculation, and it can effectively improve the geologic hazard identification accuracy. By designing experiments, the geologic hazard radar image data are verified, and the practicality of radar image intelligent Identification under CNN and big data technology is also verified. The results show that the images of different resolution sizes all play a significant role in identification of geologic hazard performed by CNN. However, there are differences in the performance of different CNN models. With the continuous increase of training samples, the identification accuracy of various network models is also improved. By means of radar image test, the identification capability of CNN model is the best, the highest precision is 93.61%, and the geologic hazard recall rate is 98.27%. Apriori algorithm is introduced into data processing, and the running speed and efficiency of identification models are improved, with favorable identification effect in variable data sets. This research can provide theoretical ideas and practical value for the development of potential geologic hazard identification on tourist routes.


Sign in / Sign up

Export Citation Format

Share Document