Your search
Results 34 resources
-
—Orthogonal time frequency space (OTFS) modulation, combined with massive multiple-input–multiple-output (MIMO) technology, offers robust performance in high-mobility environments and high-user densities by capturing the full diversity of the wireless channel and effectively utilizing spatial multiplexing. This article introduces an adaptive block sparse backtracking (ABSB) algorithm designed to enhance channel estimation in OTFS with massive MIMO (massive MIMO-OTFS) systems. The proposed ABSB algorithm features dynamic block size adjustment based on the residual signal, improving its adaptability to the varying sparsity structure of the channel. Additionally, the algorithm extends the selection range of related block atoms to increase redundancy, reducing the risk of underfitting. Comprehensive simulation results demonstrate that the ABSB algorithm significantly outperforms traditional pilot-based methods in terms of channel estimation accuracy. It also surpasses the block orthogonal matching pursuit (BOMP) method as well as other classical compressed sensing methods. Specifically, the ABSB algorithm achieves up to a 20% reduction in estimation error compared to some of these traditional methods. The enhanced adaptability and robustness of the ABSB algorithm make it a promising solution for channel estimation in massive MIMO-OTFS systems, paving the way for more reliable and efficient next-generation wireless communications.
-
<jats:p> This work compares the performance of different algorithms — quantum Fourier transform, Gaussian–Newton method, hyperfast, metropolis-adjusted Langevin algorithm, and nonparametric classification and regression trees — for the classification of fetal health states from FHR signals. In the conducted research, the effectiveness of each algorithm was measured using confusion matrices, which gave information about class precision, recall, and total accuracy in three classes: Normal, Suspect, and Pathological. The QFT algorithm gives an overall accuracy of 90%, where it is highly reliable in recognizing Normal (94% F1-score) and Pathological states (91% F1-score), but performs poorly regarding the Suspect cases, at 58% F1-score. On the other hand, using the GNM method gives an accuracy of 88%, whereby it performed well on Normal cases, at 93% F1-score, and poor performance with Suspect, at 50% F1-score, and Pathological classifications, at 82% F1-score. The hyperfast algorithm yielded an accuracy of 89%, thus performing well on Normal classifications with an F1-score of 93%, but less well on the Suspect states with an F1-score of 56%. The MALA algorithm outperformed all other algorithms tested in this study, giving an overall accuracy of 91% and adequately classifying Normal, Suspect, and Pathological states with corresponding F1-scores of 94%, 63%, and 90%, respectively; therefore, the algorithm is quite robust and reliable for fetal health monitoring. The NCART algorithm achieved an accuracy of 89%, thus showing great capability for classification in Normal cases with 94% F1-score and in Pathological cases with 88% F1-score; this is moderate for Suspect cases with 53% F1-score. Overall, while all algorithms exhibit potential for fetal health classification, MALA stands out as the most effective, offering reliable classification across all health states. These findings highlight the need for further refinement, particularly in enhancing the detection of Suspect conditions, to ensure comprehensive and accurate fetal health monitoring. </jats:p>
-
Accurate classification of brain tumors from MRI is critical for effective diagnosis and treatment. In this study, we introduce Trans-EffNet, a hybrid model combining pre-trained EfficientNet architectures with a transformer encoder to enhance brain tumor classification accuracy. By leveraging EfficientNet's deep CNN capabilities for localized feature extraction and the transformer encoder for capturing global contextual relationships, our model improves the identification of intricate tumor characteristics. Fine-tuned with ImageNet-derived weights and utilizing extensive data augmentation, Trans-EffNet was validated on both multi-class and binary datasets. Trans-EffNetB1 achieved 99.49 % accuracy on the multi-class dataset, while Trans-EffNetB2 recorded 99.83 % accuracy on the binary dataset, with perfect precision, recall, and F1-Score. These results underscore Trans-EffNet's robustness and potential as a significant advancement in brain tumor detection and classification.
-
<jats:title>Abstract</jats:title> <jats:p> <jats:italic>Objective.</jats:italic> Mild cognitive impairment (MCI) is a precursor stage of dementia characterized by mild cognitive decline in one or more cognitive domains, without meeting the criteria for dementia. MCI is considered a prodromal form of Alzheimer’s disease (AD). Early identification of MCI is crucial for both intervention and prevention of AD. To accurately identify MCI, a novel multimodal 3D imaging data integration graph convolutional network (GCN) model is designed in this paper. <jats:italic>Approach.</jats:italic> The proposed model utilizes 3D-VGGNet to extract three-dimensional features from multimodal imaging data (such as structural magnetic resonance imaging and fluorodeoxyglucose positron emission tomography), which are then fused into feature vectors as the node features of a population graph. Non-imaging features of participants are combined with the multimodal imaging data to construct a population sparse graph. Additionally, in order to optimize the connectivity of the graph, we employed the pairwise attribute estimation (PAE) method to compute the edge weights based on non-imaging data, thereby enhancing the effectiveness of the graph structure. Subsequently, a population-based GCN integrates the structural and functional features of different modal images into the features of each participant for MCI classification. <jats:italic>Main results.</jats:italic> Experiments on the AD Neuroimaging Initiative demonstrated accuracies of 98.57%, 96.03%, and 96.83% for the normal controls (NC)-early MCI (EMCI), NC-late MCI (LMCI), and EMCI-LMCI classification tasks, respectively. The AUC, specificity, sensitivity, and F1-score are also superior to state-of-the-art models, demonstrating the effectiveness of the proposed model. Furthermore, the proposed model is applied to the ABIDE dataset for autism diagnosis, achieving an accuracy of 91.43% and outperforming the state-of-the-art models, indicating excellent generalization capabilities of the proposed model. <jats:italic>Significance.</jats:italic> This study demonstrate<jats:bold>s</jats:bold> the proposed model’s ability to integrate multimodal imaging data and its excellent ability to recognize MCI. This will help achieve early warning for AD and intelligent diagnosis of other brain neurodegenerative diseases.</jats:p>
-
<jats:p>Personalized recommendation plays an important role in many online service fields. In the field of tourism recommendation, tourist attractions contain rich context and content information. These implicit features include not only text, but also images and videos. In order to make better use of these features, researchers usually introduce richer feature information or more efficient feature representation methods, but the unrestricted introduction of a large amount of feature information will undoubtedly reduce the performance of the recommendation system. We propose a novel heterogeneous multimodal representation learning method for tourism recommendation. The proposed model is based on two-tower architecture, in which the item tower handles multimodal latent features: Bidirectional Long Short-Term Memory (Bi-LSTM) is used to extract the text features of items, and an External Attention Transformer (EANet) is used to extract image features of items, and connect these feature vectors with item IDs to enrich the feature representation of items. In order to increase the expressiveness of the model, we introduce a deep fully connected stack layer to fuse multimodal feature vectors and capture the hidden relationship between them. The model is tested on the three different datasets, our model is better than the baseline models in NDCG and precision.</jats:p>
-
结合实用案例讲解Java语法、面向对象程序设计技术和核心API。 全书共10章,内容涵盖Java概述、Java语法基础、面向对象基础、面向对象高级技术、 Java API、异常处理机制、Java I/O流、多线程、Java GUI编程和Java网络编程等知识要点。案例丰富,以JDK 17和IntelliJ IDEA等流行的开发环境为依托,力求让读者通过案例 掌握Java编程技术。另一个特色是在阐释专业内容的同时自然融入思政元素, 具有鲜明的时代性和引领性。 可作为普通高等院校计算机、软件工程、人工智能等专业“面向对象程序设计”“Java程序设计”课程的教材,也适 合编程爱好者自学和培训使用。
-
Text classification is an important topic in natural language processing, with the development of social network, many question-and-answer pairs regarding health-care and medicine flood social platforms. It is of great social value to mine and classify medical text and provide targeted medical services for patients. The existing algorithms of text classification can deal with simple semantic text, especially in the field of Chinese medical text, the text structure is complex and includes a large number of medical nomenclature and professional terms, which are difficult for patients to understand. We propose a Chinese medical text classification model using a BERT-based Chinese text encoder by N-gram representations (ZEN) and capsule network, which represent feature uses the ZEN model and extract the features by capsule network, we also design a N-gram medical dictionary to enhance medical text representation and feature extraction. The experimental results show that the precision, recall and F1-score of our model are improved by 10.25%, 11.13% and 12.29%, respectively, compared with the baseline models in average, which proves that our model has better performance.
-
The number of tourist attractions reviews, travel notes and other texts has grown exponentially in the Internet age. Effectively mining users’ potential opinions and emotions on tourist attractions, and helping to provide users with better recommendation services, which is of great practical significance. This paper proposes a multi-channel neural network model called Pre-BiLSTM combined with a pre-training mechanism. The model uses a combination of coarse and fine- granularity strategies to extract the features of text information such as reviews and travel notes to improve the performance of text sentiment analysis. First, we construct three channels and use the improved BERT and skip-gram methods with negative sampling to vectorize the word-level and vocabulary-level text, respectively, so as to obtain more abundant textual information. Second, we use the pre-training mechanism of BERT to generate deep bidirectional language representation relationships. Third, the vectors of the three channels are input into the BiLSTM network in parallel to extract global and local features. Finally, the model fuses the text features of the three channels and classifies them using SoftMax classifier. Furthermore, numerical experiments are conducted to demonstrate that Pre-BiLSTM outperforms the baselines by 6.27%, 12.83% and 18.12% in average in terms of accuracy, precision and F1-score.
-
Dance education has undergone significant changes with the integration of information technology. Traditional dance pedagogy is now complemented by innovative digital software tools and applications. This work surveys the diverse applications of information technology in dance education at college or university level and the impact it has on teaching and learning processes. We discuss the integration of technology in various aspects of dance education, including skill development, choreography, performance analysis, VR/AR, online virtual learning, and collaborative learning. Additionally, the benefits and challenges associated with the use of information technology are also examined and the future research directions for research and practice in this field are proposed.
-
Recently, a lot of Chinese patients consult treatment plans through social networking platforms, but the Chinese medical text contains rich information, including a large number of medical nomenclatures and symptom descriptions. How to build an intelligence model to automatically classify the text information consulted by patients and recommend the correct department for patients is very important. In order to address the problem of insufficient feature extraction from Chinese medical text and low accuracy, this paper proposes a dual channel Chinese medical text classification model. The model extracts feature of Chinese medical text at different granularity, comprehensively and accurately obtains effective feature information, and finally recommends departments for patients according to text classification. One channel of the model focuses on medical nomenclatures, symptoms and other words related to hospital departments, gives different weights, calculates corresponding feature vectors with convolution kernels of different sizes, and then obtains local text representation. The other channel uses the BiGRU network and attention mechanism to obtain text representation, highlighting the important information of the whole sentence, that is, global text representation. Finally, the model uses full connection layer to combine the representation vectors of the two channels, and uses Softmax classifier for classification. The experimental results show that the accuracy, recall and F1-score of the model are improved by 10.65%, 8.94% and 11.62% respectively compared with the baseline models in average, which proves that our model has better performance and robustness.
-
Traditional text classification models have some drawbacks, such as the inability of the model to focus on important parts of the text contextual information in text processing. To solve this problem, we fuse the long and short-term memory network BiGRU with a convolutional neural network to receive text sequence input to reduce the dimensionality of the input sequence and to reduce the loss of text features based on the length and context dependency of the input text sequence. Considering the extraction of important features of the text, we choose the long and short-term memory network BiLSTM to capture the main features of the text and thus reduce the loss of features. Finally, we propose a BiGRU-CNN-BiLSTM model (DCRC model) based on CNN, GRU and LSTM, which is trained and validated on the THUCNews and Toutiao News datasets. The model outperformed the traditional model in terms of accuracy, recall and F1 score after experimental comparison.
-
In view of the complex marine environment of navigation, especially in the case of multiple static and dynamic obstacles, the traditional obstacle avoidance algorithms applied to unmanned surface vehicles (USV) are prone to fall into the trap of local optimization. Therefore, this paper proposes an improved artificial potential field (APF) algorithm, which uses 5G communication technology to communicate between the USV and the control center. The algorithm introduces the USV discrimination mechanism to avoid the USV falling into local optimization when the USV encounter different obstacles in different scenarios. Considering the various scenarios between the USV and other dynamic obstacles such as vessels in the process of performing tasks, the algorithm introduces the concept of dynamic artificial potential field. For the multiple obstacles encountered in the process of USV sailing, based on the International Regulations for Preventing Collisions at Sea (COLREGS), the USV determines whether the next step will fall into local optimization through the discrimination mechanism. The local potential field of the USV will dynamically adjust, and the reverse virtual gravitational potential field will be added to prevent it from falling into the local optimization and avoid collisions. The objective function and cost function are designed at the same time, so that the USV can smoothly switch between the global path and the local obstacle avoidance. The simulation results show that the improved APF algorithm proposed in this paper can successfully avoid various obstacles in the complex marine environment, and take navigation time and economic cost into account.
-
Medical classification is affected by many factors, and the traditional medical classification is usually restricted by factors such as too long text, numerous categories and so on. In order to solve these problems, this paper uses word vector and word vector to mine the text deeply, considering the problem of scattered key features of medical text, introducing long-term and short-term memory network to effectively retain the features of historical information in long text sequence, and using the structure of CNN to extract local features of text, through attention mechanism to obtain key features, considering the problems of many diseases, by using hierarchical classification. To stratify the disease. Combined with the above ideas, a deep DLCF model suitable for long text and multi-classification is designed. This model has obvious advantages in CMDD and other datasets. Compared with the baseline models, this model is superior to the baseline model in accuracy, recall and other indicators.
-
Investor sentiment and emotions have a strong impact on financial markets. In recent years there has been increasing interest in analyzing the sentiment of investors for stock price prediction using machine learning. Existing prediction models mostly depend on the analysis of trading data and company profit. few prediction theories have been built based on individual investors' sentiments. The fundamental reason is the difficulty to measure individual investors' sentiment.
-
Convolutional neural network (CNN) model based on deep learning has excellent performance for target detection. However, the detection effect is poor when the object is circular or tubular because most of the existing object detection methods are based on the traditional rectangular box to detect and recognize objects. To solve the problem, we propose the circular representation structure and RepVGG module on the basis of CenterNet and expand the network prediction structure, thus proposing a high-precision and high-efficiency lightweight circular object detection method RebarDet. Specifically, circular tubular type objects will be optimized by replacing the traditional rectangular box with a circular box. Second, we improve the resolution of the network feature map and the upper limit of the number of objects detected in a single detect to achieve the expansion of the network prediction structure, optimized for the dense phenomenon that often occurs in circular tubular objects. Finally, the multibranch topology of RepVGG is introduced to sum the feature information extracted by different convolution modules, which improves the ability of the convolution module to extract information. We conducted extensive experiments on rebar datasets and used AB-Score as a new evaluation method to evaluate RebarDet. The experimental results show that RebarDet can achieve a detection accuracy of up to 0.8114 and a model inference speed of 6.9 fps while maintaining a moderate amount of parameters, which is superior to other mainstream object detection models and verifies the effectiveness of our proposed method. At the same time, RebarDet’s high precision detection of round tubular objects facilitates enterprise intelligent manufacturing processes.
Explore
Academic Units
Resource type
- Book (1)
- Book Section (6)
- Conference Paper (7)
- Journal Article (19)
- Report (1)
Cooperation
-
China
(2)
- Henan University (2)
- Macau (1)