Results | Kerko

Cui, Y., Liang, S., & Zhang, Y. (2024). Multimodal representation learning for tourism recommendation with two-tower architecture. PLOS ONE, 19(2). https://doi.org/10.1371/journal.pone.0299370

<jats:p>Personalized recommendation plays an important role in many online service fields. In the field of tourism recommendation, tourist attractions contain rich context and content information. These implicit features include not only text, but also images and videos. In order to make better use of these features, researchers usually introduce richer feature information or more efficient feature representation methods, but the unrestricted introduction of a large amount of feature information will undoubtedly reduce the performance of the recommendation system. We propose a novel heterogeneous multimodal representation learning method for tourism recommendation. The proposed model is based on two-tower architecture, in which the item tower handles multimodal latent features: Bidirectional Long Short-Term Memory (Bi-LSTM) is used to extract the text features of items, and an External Attention Transformer (EANet) is used to extract image features of items, and connect these feature vectors with item IDs to enrich the feature representation of items. In order to increase the expressiveness of the model, we introduce a deep fully connected stack layer to fuse multimodal feature vectors and capture the hidden relationship between them. The model is tested on the three different datasets, our model is better than the baseline models in NDCG and precision.</jats:p>

View on dspace.usj.edu.mo

Liang, S., Chen, T., Ma, J., Ren, S., Lu, X., & Du, G. (2024). Identification of mild cognitive impairment using multimodal 3D imaging data and graph convolutional networks. Physics in Medicine & Biology, 69(23). https://doi.org/10.1088/1361-6560/ad8c94

<jats:title>Abstract</jats:title> <jats:p> <jats:italic>Objective.</jats:italic> Mild cognitive impairment (MCI) is a precursor stage of dementia characterized by mild cognitive decline in one or more cognitive domains, without meeting the criteria for dementia. MCI is considered a prodromal form of Alzheimer’s disease (AD). Early identification of MCI is crucial for both intervention and prevention of AD. To accurately identify MCI, a novel multimodal 3D imaging data integration graph convolutional network (GCN) model is designed in this paper. <jats:italic>Approach.</jats:italic> The proposed model utilizes 3D-VGGNet to extract three-dimensional features from multimodal imaging data (such as structural magnetic resonance imaging and fluorodeoxyglucose positron emission tomography), which are then fused into feature vectors as the node features of a population graph. Non-imaging features of participants are combined with the multimodal imaging data to construct a population sparse graph. Additionally, in order to optimize the connectivity of the graph, we employed the pairwise attribute estimation （PAE） method to compute the edge weights based on non-imaging data, thereby enhancing the effectiveness of the graph structure. Subsequently, a population-based GCN integrates the structural and functional features of different modal images into the features of each participant for MCI classification. <jats:italic>Main results.</jats:italic> Experiments on the AD Neuroimaging Initiative demonstrated accuracies of 98.57%, 96.03%, and 96.83% for the normal controls (NC)-early MCI (EMCI), NC-late MCI (LMCI), and EMCI-LMCI classification tasks, respectively. The AUC, specificity, sensitivity, and F1-score are also superior to state-of-the-art models, demonstrating the effectiveness of the proposed model. Furthermore, the proposed model is applied to the ABIDE dataset for autism diagnosis, achieving an accuracy of 91.43% and outperforming the state-of-the-art models, indicating excellent generalization capabilities of the proposed model. <jats:italic>Significance.</jats:italic> This study demonstrate<jats:bold>s</jats:bold> the proposed model’s ability to integrate multimodal imaging data and its excellent ability to recognize MCI. This will help achieve early warning for AD and intelligent diagnosis of other brain neurodegenerative diseases.</jats:p>

View on dspace.usj.edu.mo

Al-Razgan, M., Ali, Y. A., Neira-Molina, H., Ma, H., Du, G., & Ain, Q. U. (2024). Optimizing Fetal Health Status Detection Using Quantum Intelligent Deep-Learning Methods on Cardiotocographic Data. SPIN, 15(02). https://doi.org/10.1142/S2010324724400058

<jats:p> This work compares the performance of different algorithms — quantum Fourier transform, Gaussian–Newton method, hyperfast, metropolis-adjusted Langevin algorithm, and nonparametric classification and regression trees — for the classification of fetal health states from FHR signals. In the conducted research, the effectiveness of each algorithm was measured using confusion matrices, which gave information about class precision, recall, and total accuracy in three classes: Normal, Suspect, and Pathological. The QFT algorithm gives an overall accuracy of 90%, where it is highly reliable in recognizing Normal (94% F1-score) and Pathological states (91% F1-score), but performs poorly regarding the Suspect cases, at 58% F1-score. On the other hand, using the GNM method gives an accuracy of 88%, whereby it performed well on Normal cases, at 93% F1-score, and poor performance with Suspect, at 50% F1-score, and Pathological classifications, at 82% F1-score. The hyperfast algorithm yielded an accuracy of 89%, thus performing well on Normal classifications with an F1-score of 93%, but less well on the Suspect states with an F1-score of 56%. The MALA algorithm outperformed all other algorithms tested in this study, giving an overall accuracy of 91% and adequately classifying Normal, Suspect, and Pathological states with corresponding F1-scores of 94%, 63%, and 90%, respectively; therefore, the algorithm is quite robust and reliable for fetal health monitoring. The NCART algorithm achieved an accuracy of 89%, thus showing great capability for classification in Normal cases with 94% F1-score and in Pathological cases with 88% F1-score; this is moderate for Suspect cases with 53% F1-score. Overall, while all algorithms exhibit potential for fetal health classification, MALA stands out as the most effective, offering reliable classification across all health states. These findings highlight the need for further refinement, particularly in enhancing the detection of Suspect conditions, to ensure comprehensive and accurate fetal health monitoring. </jats:p>

View on dspace.usj.edu.mo

Nizamani, A. H., Chen, Z., Nizamani, A. A., Bhatti, M. A., Ma, H., & Du, G. (2024). Trans-EffNet: A Hybrid Model for Brain Tumor Detection Using EfficientNet and Transformer Encoder. 2024 IEEE Smart World Congress (SWC). https://doi.org/10.1109/SWC62898.2024.00260

Accurate classification of brain tumors from MRI is critical for effective diagnosis and treatment. In this study, we introduce Trans-EffNet, a hybrid model combining pre-trained EfficientNet architectures with a transformer encoder to enhance brain tumor classification accuracy. By leveraging EfficientNet's deep CNN capabilities for localized feature extraction and the transformer encoder for capturing global contextual relationships, our model improves the identification of intricate tumor characteristics. Fine-tuned with ImageNet-derived weights and utilizing extensive data augmentation, Trans-EffNet was validated on both multi-class and binary datasets. Trans-EffNetB1 achieved 99.49 % accuracy on the multi-class dataset, while Trans-EffNetB2 recorded 99.83 % accuracy on the binary dataset, with perfect precision, recall, and F1-Score. These results underscore Trans-EffNet's robustness and potential as a significant advancement in brain tumor detection and classification.

View on dspace.usj.edu.mo

Wang, H., Chen, Q., Wang, X., Du, G., Li, X., & Nallanathan, A. (2025). Adaptive Block Sparse Backtracking-Based Channel Estimation for Massive MIMO-OTFS Systems. IEEE Internet of Things Journal, 12(1). https://doi.org/10.1109/JIOT.2024.3466911

—Orthogonal time frequency space (OTFS) modulation, combined with massive multiple-input–multiple-output (MIMO) technology, offers robust performance in high-mobility environments and high-user densities by capturing the full diversity of the wireless channel and effectively utilizing spatial multiplexing. This article introduces an adaptive block sparse backtracking (ABSB) algorithm designed to enhance channel estimation in OTFS with massive MIMO (massive MIMO-OTFS) systems. The proposed ABSB algorithm features dynamic block size adjustment based on the residual signal, improving its adaptability to the varying sparsity structure of the channel. Additionally, the algorithm extends the selection range of related block atoms to increase redundancy, reducing the risk of underfitting. Comprehensive simulation results demonstrate that the ABSB algorithm significantly outperforms traditional pilot-based methods in terms of channel estimation accuracy. It also surpasses the block orthogonal matching pursuit (BOMP) method as well as other classical compressed sensing methods. Specifically, the ABSB algorithm achieves up to a 20% reduction in estimation error compared to some of these traditional methods. The enhanced adaptability and robustness of the ABSB algorithm make it a promising solution for channel estimation in massive MIMO-OTFS systems, paving the way for more reliable and efficient next-generation wireless communications.

View on dspace.usj.edu.mo

Ma, J., Du, W., & Lu, W. (2022). Message from Program Chairs. Proceedings - 2022 IEEE/ACIS 22nd International Conference on Computer and Information Science, ICIS 2022, X. Scopus. https://doi.org/10.1109/ICIS54925.2022.9882414

Read document

Ma, J., Du, W., & Lu, W. (2023). Foreword. Studies in Computational Intelligence, 1055, v–vii. Scopus.

Du, W., Chen, J., & Xu, S. (2023). Exploring the Current Landscape and Future Directions of Information Technology in Dance Education. 2023 26th ACIS International Winter Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD-Winter), 120–126. https://doi.org/10.1109/SNPD-Winter57765.2023.10224030

Dance education has undergone significant changes with the integration of information technology. Traditional dance pedagogy is now complemented by innovative digital software tools and applications. This work surveys the diverse applications of information technology in dance education at college or university level and the impact it has on teaching and learning processes. We discuss the integration of technology in various aspects of dance education, including skill development, choreography, performance analysis, VR/AR, online virtual learning, and collaborative learning. Additionally, the benefits and challenges associated with the use of information technology are also examined and the future research directions for research and practice in this field are proposed.

View on ieeexplore.ieee.org

梁胜彬. (2023). Java面向对象程序设计. 清华大学出版社. http://www.tup.tsinghua.edu.cn/Wap/tsxqy.aspx?id=09722601

结合实用案例讲解Java语法、面向对象程序设计技术和核心API。全书共10章，内容涵盖Java概述、Java语法基础、面向对象基础、面向对象高级技术、 Java API、异常处理机制、Java I/O流、多线程、Java GUI编程和Java网络编程等知识要点。案例丰富，以JDK 17和IntelliJ IDEA等流行的开发环境为依托，力求让读者通过案例掌握Java编程技术。另一个特色是在阐释专业内容的同时自然融入思政元素，具有鲜明的时代性和引领性。可作为普通高等院校计算机、软件工程、人工智能等专业“面向对象程序设计”“Java程序设计”课程的教材，也适合编程爱好者自学和培训使用。

View on www.tup.tsinghua.edu.cn

Ma, H., Ma, J., Liang, S., & Du, W. (2022). A Model of Integrating Bert and BiGRU+ Attention Dual-channel Mechanism for Investor Sentiment Analysis of Stock Price Forecast. 2022 IEEE/ACIS 23rd International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 126–131. https://doi.org/10.1109/SNPD54884.2022.10051779

Investor sentiment and emotions have a strong impact on financial markets. In recent years there has been increasing interest in analyzing the sentiment of investors for stock price prediction using machine learning. Existing prediction models mostly depend on the analysis of trading data and company profit. few prediction theories have been built based on individual investors' sentiments. The fundamental reason is the difficulty to measure individual investors' sentiment.

View on ieeexplore.ieee.org

Li, X., Jiao, T., Ma, J., Duan, D., & Liang, S. (2023). LSDA-APF: A Local Obstacle Avoidance Algorithm for Unmanned Surface Vehicles Based on 5G Communication Environment. Computer Modeling in Engineering & Sciences, 138(1), 595–617. https://doi.org/10.32604/cmes.2023.029367

In view of the complex marine environment of navigation, especially in the case of multiple static and dynamic obstacles, the traditional obstacle avoidance algorithms applied to unmanned surface vehicles (USV) are prone to fall into the trap of local optimization. Therefore, this paper proposes an improved artificial potential field (APF) algorithm, which uses 5G communication technology to communicate between the USV and the control center. The algorithm introduces the USV discrimination mechanism to avoid the USV falling into local optimization when the USV encounter different obstacles in different scenarios. Considering the various scenarios between the USV and other dynamic obstacles such as vessels in the process of performing tasks, the algorithm introduces the concept of dynamic artificial potential field. For the multiple obstacles encountered in the process of USV sailing, based on the International Regulations for Preventing Collisions at Sea (COLREGS), the USV determines whether the next step will fall into local optimization through the discrimination mechanism. The local potential field of the USV will dynamically adjust, and the reverse virtual gravitational potential field will be added to prevent it from falling into the local optimization and avoid collisions. The objective function and cost function are designed at the same time, so that the USV can smoothly switch between the global path and the local obstacle avoidance. The simulation results show that the improved APF algorithm proposed in this paper can successfully avoid various obstacles in the complex marine environment, and take navigation time and economic cost into account.

Read document

Liang, S., Jin, J., Du, W., & Qu, S. (2023). A Multi-Channel Text Sentiment Analysis Model Integrating Pre-training Mechanism. Information Technology and Control, 52(2), 263–275. https://doi.org/10.5755/j01.itc.52.2.31803

The number of tourist attractions reviews, travel notes and other texts has grown exponentially in the Internet age. Effectively mining users’ potential opinions and emotions on tourist attractions, and helping to provide users with better recommendation services, which is of great practical significance. This paper proposes a multi-channel neural network model called Pre-BiLSTM combined with a pre-training mechanism. The model uses a combination of coarse and fine- granularity strategies to extract the features of text information such as reviews and travel notes to improve the performance of text sentiment analysis. First, we construct three channels and use the improved BERT and skip-gram methods with negative sampling to vectorize the word-level and vocabulary-level text, respectively, so as to obtain more abundant textual information. Second, we use the pre-training mechanism of BERT to generate deep bidirectional language representation relationships. Third, the vectors of the three channels are input into the BiLSTM network in parallel to extract global and local features. Finally, the model fuses the text features of the three channels and classifies them using SoftMax classifier. Furthermore, numerical experiments are conducted to demonstrate that Pre-BiLSTM outperforms the baselines by 6.27%, 12.83% and 18.12% in average in terms of accuracy, precision and F1-score.

Read document

Liang, S., Sun, F., Sun, H., Chen, T., & Du, W. (2023). A medical text classification approach with ZEN and capsule network. The Journal of Supercomputing. https://doi.org/10.1007/s11227-023-05612-6

Text classification is an important topic in natural language processing, with the development of social network, many question-and-answer pairs regarding health-care and medicine flood social platforms. It is of great social value to mine and classify medical text and provide targeted medical services for patients. The existing algorithms of text classification can deal with simple semantic text, especially in the field of Chinese medical text, the text structure is complex and includes a large number of medical nomenclature and professional terms, which are difficult for patients to understand. We propose a Chinese medical text classification model using a BERT-based Chinese text encoder by N-gram representations (ZEN) and capsule network, which represent feature uses the ZEN model and extract the features by capsule network, we also design a N-gram medical dictionary to enhance medical text representation and feature extraction. The experimental results show that the precision, recall and F1-score of our model are improved by 10.25%, 11.13% and 12.29%, respectively, compared with the baseline models in average, which proves that our model has better performance.

View on doi.org

Li, W., Yang, Q., & Du, W. (2022). Tourist Sentiment Mining Based on Deep Learning. In C. Thomas (Ed.), Artificial Intelligence (Vol. 8). IntechOpen. https://doi.org/10.5772/intechopen.98836

Mining the sentiment of the user on the internet via the context plays a significant role in uncovering the human emotion and in determining the exactness of the underlying emotion in the context. An increasingly enormous number of user-generated content (UGC) in social media and online travel platforms lead to development of data-driven sentiment analysis (SA), and most extant SA in the domain of tourism is conducted using document-based SA (DBSA). However, DBSA cannot be used to examine what specific aspects need to be improved or disclose the unknown dimensions that affect the overall sentiment like aspect-based SA (ABSA). ABSA requires accurate identification of the aspects and sentiment orientation in the UGC. In this book chapter, we illustrate the contribution of data mining based on deep learning in sentiment and emotion detection.

View on www.intechopen.com

Li, N., Yang, X., Du, W., Ogihara, A., Zhou, S., Ma, X., Wang, Y., Li, S., & Li, K. (2022). Exploratory Research on Key Technology of Human-Computer Interactive 2.5-Minute Fast Digital Early Warning for Mild Cognitive Impairment. Computational Intelligence and Neuroscience, 2022, 1–15. https://doi.org/10.1155/2022/2495330

Objective. As the preclinical stage of Alzheimer’s disease (AD), Mild Cognitive Impairment (MCI) is characterized by hidden onset, which is difficult to detect early. Traditional neuropsychological scales are main tools used for assessing MCI. However, due to its strong subjectivity and the influence of many factors such as subjects’ educational background, language and hearing ability, and time cost, its accuracy as the standard of early screening is low. Therefore, the purpose of this paper is to propose a new key technology of fast digital early warning for MCI based on eye movement objective data analysis. Methodology. Firstly, four exploratory indexes (test durations, correlation degree, lengths of gaze trajectory, and drift rate) of MCI early warning are determined based on the relevant literature research and semistructured expert interview; secondly, the eye movement state is captured based on the eye tracker to realize the data extraction of four exploratory indexes. On this basis, the human-computer interactive 2.5-minute fast digital early warning paradigm for MCI is designed; thirdly, the rationality of the four early warning indexes proposed in this paper and their early warning effectiveness on MCI are verified. Results. Through the small sample test of human-computer interactive 2.5 fast digital early warning paradigm for MCI conducted by 32 elderly people aged 70–90 in a medical institution in Hangzhou, the two indexes of “correlation degree” and “drift rate” with statistical differences are selected. The experiment results show that AUC of this MCI early warning paradigm is 0.824. Conclusion. The key technology of human-computer interactive 2.5 fast digital early warning for MCI proposed in this paper overcomes the limitations of the existing MCI early warning tools, such as low objectification level, high dependence on professional doctors, long test time, requiring high educational level, and so on. The experiment results show that the early warning technology, as a new generation of objective and effective digital early warning tool, can realize 2.5-minute fast and high-precision preliminary screening and early warning for MCI in the elderly.

Read document

Ma, X., Chen, Y., Li, S., Du, W., Tan, Z., Li, Q., Ma, Y., & Deng, H. (2021). Quality Detection Method for Controller Process Based on Mask R-CNN Network Model. 2021 8th International Conference on Computational Science/Intelligence and Applied Informatics (CSII), 18–22. https://doi.org/10.1109/CSII54342.2021.00012

View on ieeexplore.ieee.org

Han, W., Shasha, L., Shaobin, L., Wencai, D., Qinggu, L., & Bo, W. (2021). Research on Quality Control of Intelligent Injection Molding System Based on Process Monitoring. 2021 8th International Conference on Computational Science/Intelligence and Applied Informatics (CSII), 1–6. https://doi.org/10.1109/CSII54342.2021.00013

View on ieeexplore.ieee.org

Barzinji, A. O., Ma, C., Du, W., & Ma, J. (2021). A Machine Learning Approach to Predict the Trend of Obesity Prevalence at a Global Level. 2021 IEEE/ACIS 6th International Conference on Big Data, Cloud Computing, and Data Science (BCD), 25–30. https://doi.org/10.1109/BCD51206.2021.9581579

View on ieeexplore.ieee.org

Wang, H., Yang, J., Wu, Y., Du, W., Fong, S., Duan, Y., Yao, X., Zhou, X., Li, Q., Lin, C., Liu, J., Huang, L., & Wu, F. (2021). A Fast Lightweight Based Deep Fusion Learning for Detecting Macula Fovea Using Ultra-Widefield Fundus Images [Preprint]. MATHEMATICS & COMPUTER SCIENCE. https://doi.org/10.20944/preprints202108.0469.v2

Macula fovea detection is a crucial prerequisite towards screening and diagnosing macular diseases. Without early detection and proper treatment, any abnormality involving the macula may lead to blindness. However, with the ophthalmologist shortage and time-consuming artificial evaluation, neither accuracy nor effectiveness of the diagnose process could be guaranteed. In this project, we proposed a deep learning approach on ultra-widefield fundus (UWF) images for macula fovea detection. This study collected 2300 ultra-widefield fundus images from Shenzhen Aier Eye Hospital in China. Methods based on U-shape network (Unet) and Fully Convolutional Networks (FCN) are implemented on 1800 (before amplifying process) training fundus images, 400 (before amplifying process) validation images and 100 test images. Three professional ophthalmologists were invited to mark the fovea. A method from the anatomy perspective is investigated. This approach is derived&ensp;from the spatial relationship between macula fovea and optic disc center in UWF. A set of parameters of this method is set based on the experience of ophthalmologists and verified to be effective. Results are measured by calculating the Euclidean distance between proposed approaches and the accurate grounded standard, which is detected by Ultra-widefield swept-source optical coherence tomograph (UWF-OCT) approach. Through a comparation of proposed methods, we conclude that, deep learning approach of Unet outperformed other methods on macula fovea detection tasks, by which outcomes obtained are comparable to grounded standard method.

View on www.preprints.org

Gois, F. N. B., Marques, J. A. L., De Oliveira Dantas, A. B., Santos, M. C., Neto, J. V. S., De Macêdo, J. A. F., Du, W., & Li, Y. (2023). Malaria Blood Smears Object Detection Based on Convolutional DCGAN and CNN Deep Learning Architectures. In R. Lee (Ed.), Computer and Information Science (Vol. 1055, pp. 197–212). Springer International Publishing. https://doi.org/10.1007/978-3-031-12127-2_14

View on link.springer.com

Your search

Results 34 resources

Explore

Academic Units

Resource type

Cooperation

Publication year