Browse Preprints - Authorea

2012 signal processing and analysis Preprints

Please note: These are preprints and have not been peer reviewed. Data may be preliminary.

A ConvTransNet Model Based on I/Q-Language Mutual Learning and Supervised Learning fo...

Wenhan Li

and 4 more

June 07, 2024

Automatic modulation recognition (AMR) is an important signal classification technology in cognitive radio. As AMR advances, an increasing number of artificial neural networks are being employed in the field to enhance its performance. In order to further improve its performance, a ConTransNet model based on I/Q-language mutual learning and supervised learning is proposed in this work. First, a ConTransNet model is introduced to handle modulation signals. The model consists of two branches: one is CNN, and the other is transformer. To facilitate information exchange between the two branches, an information interaction module is introduced, implemented with a bridge connection. To enhance the model's performance, a training algorithm called I/Q-language mutual learning and supervised learning is designed. This method utilizes mutual supervision between the output of one branch of the ConTransNet model and the output of a language feature extraction model, while the other branch adopts supervised learning. Finally, through experimental comparisons with five other algorithms (CE-FuFormer, ConvLSTMAE, DAE, FEAT , and MCLDNN), the effectiveness of the proposed method is validated.

Joint Gray-Mapping for Multilayer Multicast NOMA with Arbitrary Modulation Orders

Hamad Yahya

and 2 more

June 07, 2024

This work considers the design of a generalized Gray-mapping process to multilayer multicast non-orthogonal multiple access (NOMA) transmission with arbitrary modulation orders. Unlike orthogonal multiple access, joint-multilayer Graymapping can provide significant energy savings and bit error rate (BER) improvements, which can be used to alleviate the degradation caused by the inherent multilayer interference in NOMA. The obtained improvement is due to the increased Euclidean distance that Gray-mapping provides for certain layers/symbols. To evaluate the impact of Gray-mapping, closed-form expressions are derived for the exact BER with imperfect successive interference cancellation. The obtained analytical and simulation results demonstrate that the proposed scheme can offer up to 10 dB gain and 94% energy saving compared to conventional NOMA in certain scenarios, where such a performance gain can be shared between the layers by selecting an appropriate power assignment. Moreover, feasibility maps are generated to demonstrate the additional flexibility that Gray-mapping can offer in terms of quality of service satisfaction for the various layers at lower signal to noise ratios.

Complex phase, asymptotic parameter and integrals

Nicolas Vacca

and 1 more

June 07, 2024

Asymptotic signals have been introduced to approximate integrals with the stationary phase approximation. However, the deviation between integrals and their approximations is not bounded without further constraints. We propose an alternative with the introduction of a parameter defined in the complex phase representation. The magnitude of this new parameter controls the local harmonicity of the signal. It therefore controls the deviation between integrals and their approximations, which do not rely on the stationary phase principle. Approximations of Fourier transforms illustrate the results. The parameter is also involved in additional properties of diverse nature.

Combining-Based Spatial Domain SIC Method in a FD MIMO System by Addition of an RF Ch...

Xuan Chen

and 4 more

June 07, 2024

In this paper, we propose and evaluate a spatial domain self-interference cancellation (SIC) method based on the principle of zero-forcing (ZF) beamforming for a generic multiple-input multiple-output (MIMO) full-duplex (FD) transceiver system in a large-scale communication scenario. The core of the proposed SIC method is to implement an additional radio-frequency (RF) chain at the receiver (RX) to induce an additional dimension on the channel matrix, so that the inversion of the channel matrix becomes feasible. To this end, we adapt, as an example, a simple ZF digital combiner in our system model to simultaneously perform SIC and enhance the communication link performance. Simulations results further confirm our proposal. In particular, our method demonstrates a strong SIC capability, even with an erroneous knowledge of the channel state information (CSI), and effectively improves the communication link performance of the FD system.

Enhanced Target Localization based on Double RIS

Lei He

and 4 more

June 07, 2024

Reconfigurable Intelligent Surface (RIS) has become an enabling technology for economically and efficiently realizing intelligent and reconfigurable wireless communication environments. Previous positioning research based on RIS has mainly focused on single RIS without considering the improvement in positioning accuracy brought by double RIS, thus failing to reveal the full potential of a double RIS-assisted wireless communication system. This paper investigates a target positioning system assisted by double RIS, where double RIS is used to enhance communication between users and the receiver. Firstly, the reception signal model based on the collaborative effect of double RIS is constructed, and then we formulate the target positioning problem based on this model, aiming to obtain the optimal target signal while minimizing system noise. Moreover, to verify the effectiveness of the localization system with double RIS assistance, we also configured a scenario of a single RIS localization system for comparative analysis. Simulation results indicate that the proposed target positioning method based on double RIS exhibits certain superiority in improving system positioning performance compared to single RIS.

Reconfigurable Intelligent Surface-Assisted Massive MIMO System to Reduce Pilot Conta...

Baohe Pang

and 3 more

June 07, 2024

In massive multiple-input multiple-output (MIMO) systems, cellular users are affected by inter-cell coherent interference, especially pilot contamination. To solve this problem and improve spectral efficiency (SE) of each cell, we propose a reconfigurable intelligent surface (RIS)-assisted cellular massive MIMO system. Based on the uplink pilot information and minimum mean square error (MMSE) method, base stations (BSs) can estimate all the uplink channels for local decoding. Then, we give an uplink signal-to-noise-plus-interference ratio (SINR) expression and an each user's SE expression based on maximum ratio combining (MRC) processing method. The effects of pilot contamination, beamforming gain, interference, and noise on the system can be seen from the expressions. Power control and RIS phase shift optimization methods are also proposed to reduce pilot contamination. The power control method is formulated to maximize each user's rate. The phase shift optimization is also formulated to maximize the target area's average power. Numerical results show that power control method and phase shift matrix optimization can improve the system performance.

On the Origin of Cardiovascular Sounds Recorded from the Ear

Bjarke Gårdbæk

and 1 more

June 07, 2024

It has been conjectured that sounds recorded at the ear, using microphones, originate from either heart sounds propagating from the heart to the recording site, through tissue and bone, or vascular activity at the recording site, such as vasodilation. However, prior studies have not been able to verify these conjectures. The aim of this study is to gain a deeper understanding of the signals measured in the ear using so called body-coupled microphones. Method: A study was conducted on 10 subjects, who were instructed to stand up, lie down and exercise with a body-coupled microphone mounted in each ear, using personalised ear-pieces. The subjects were additionally instructed to follow a guided breathing session. Recorded body-coupled microphone signals were evaluated against standard measures, such as electrocardiography, photoplethysmography, and respiration flow and effort. All signal modalities were synchronised to a common trigger signal. For the analysis, the body-coupled microphone and photoplethysmography signals were epoched in accordance with the electrocardiography R-peak. Results: Cluster-based permutation test showed that the body-coupled microphone signals were time-locked to the electrocardiography R-peak. No difference was found between ears. Discussion: Comparing the timing of the electrocardiography and photoplethysmography events suggests that signals recorded using body-coupled microphones in the ears reflect blood pressure waves in the arteries in the proximity of the body-coupled microphones.

Processing of Incomplete Multichannel SAR Data for HRWS Imaging

Govinda Behera

and 1 more

June 04, 2024

The multichannel SAR system is an advanced imaging technique that enables achieving high resolution and wide swath simultaneously. However, the fundamental hindrance in the multichannel SAR system is the failure of reception by the multiantenna system due to being in hostile working conditions. As a result of this, the received multichannel SAR data often suffer from the presence of missing samples in the azimuth domain. Besides this problem, the multichannel SAR system also has to deal with the azimuth ambiguities to obtain high azimuth resolution and wide swath. This kind of received multichannel SAR data consisting of missing azimuth samples and azimuth ambiguities cannot be processed with the conventional multichannel SAR algorithms. Therefore, we propose a novel multichannel reconstruction method to address both challenges, enabling imaging of wide swath width with high resolution from the received incomplete multichannel SAR data. Our method uses the minimum variance distortionless response (MVDR) algorithm to reconstruct an unambiguous wide azimuth spectrum. In the end, we have provided simulation results that validate the efficiency of the proposed reconstruction method.

Insights on 'Complex-Valued Iris Recognition Network'

Ajay Kumar

June 03, 2024

This paper comments on a recently published TPAMI paper presenting an iris recognition algorithm. While the approach is intriguing, we can identify several inconsistencies and errors in the exposition. Additionally, their comparison with the state-of-the-art methods lacks fairness. I take this opportunity to clarify and underline these errors, aiming to assist fellow researchers like me who are interested in advancing biometrics research.

Real-time Hand-Tracking with Skin-Conformal Force Myography based on Wrist-Worn Laser...

Vinay Kammarchedu

and 2 more

June 04, 2024

Force myography (FMG) measures the movement of limbs or appendages by monitoring force at their surface to characterize the state of the underlying musculotendinous complex. Compared to electromyography methods, FMG offers a better alternative for monitoring the muscular activity due to direct measurement of force, less sensitivity to skin preparation and skin impedance, and high compatibility with wearable prosthetics and orthotics. Despite significant progress in developing FMG systems, existing devices are still bulky and restrictive to the user or to the placement of the exoskeleton systems. In this work, we develop a wristband integrating an array of ten skin-conformal and wearable strain sensors based on laser induced graphene optimized for continuous measurement of FMG signals. We characterize the device to identify several hand gestures and tasks while simultaneously using an optical camera-based hand tracking system to estimate the joint locations for ground truth generation. We develop machine learning models to predict the gestures as well as specific hand joint angles with a high accuracy (> 90% and > 95%, respectively). We find that sensors that are placed closer to actuation specific anatomical features contribute more towards the high accuracy. We also integrate the sensor array with a wearable readout system that wirelessly transmits the data in real-time, which is used to control a robotic arm as a proof of concept for human-robot interaction applications. The developed skin-conformal FMG device is expected to find wide applications in rehabilitation, sports sciences, and humancomputer interaction, paving the way for low profile prosthetic and orthotic control systems.

Synergizing AI and CPU: Empowering Next-Generation Computing

Karthikeya Tallapaneni

and 5 more

June 04, 2024

The aim of this study, therefore, is to reinvent the future of computing systems in terms of performance, efficiency, and adaptability by identifying the "frontier" of AI-driven innovations in CPU design. * This document surveys front-end optimizations driven by AI, new architectural ideas, and upcoming paradigms that might revolutionize CPU technology through a thorough literature review as well as empirical evaluations. * With the help of advanced artificial intelligence methods like as machine learning, deep learning, and reinforcement learning, the exploration of new CPU architectures and optimization techniques permits the unleashing of computational powers beyond anything previously imagined. * AI methods on CPU architecture and address significant concerns such as scalability, energy efficiency, security, and reliability have been systematically explored. Furthermore, the transformative ability of AI-driven CPUs is also illustrated through real-world-case applications in several fields, e.g. autonomous systems, healthcare informatics, edge computing. This paper helps clear the roadmap for the next system of creative revolution in AI-imbued computing, defining potential directions for additional research and cooperation.

Low-Complexity K-Beams Clustering for Intra-Cell Pilot Reuse in Massive MIMO Communic...

Dariel Pereira-Ruisánchez

and 3 more

June 03, 2024

Massive MIMO (mMIMO) communication systems are recognized as key enablers of next-generation wireless networks. However, the orthogonal pilot assignments typical of multiple-input multiple-output (MIMO) systems are not wellsuited to emerging use cases characterized by short channel coherence intervals and high number of connected user equipments (UEs). In this work, we propose a novel approach for intracell pilot reuse that leverages the spatial features of correlated mMIMO channels to attain low pilot contamination while using a small number of pilot sequences. The first part of the proposed solution is a groundbreaking clustering algorithm termed Kbeams which splits the complex intra-cell pilot allocation into tractable problems without significant loss of optimality. Then, we introduce a heuristic approach called best-first pilot assignment (BFPA) designed to manage intra-cluster pilot assignments by minimizing interference among the most contaminating UEs. We evaluate the performance of our proposed solution (K-beams+BFPA) in terms of sum-normalized mean-squared error (NMSE) and sum-rate under various challenging network setups. Simulation results show that our approach is a robust alternative to more computationally demanding benchmarks.

Motion robustness validation of a Phase-Locked Loop for EEG phase tracking in Brain-C...

Le Xing

and 3 more

May 30, 2024

Background. Closed loop brain-computer interfaces dynamically adjust stimulation settings and/or timings based upon concurrently measured data. EEG (electroencephalography) is a widely used input. Particularly for closed loop applications in sleep monitoring, Phase Locked Loops have been used to track the narrowband phase of the EEG signal in real-time to provide a reference signal for closing the loop. During sleep, motion artifacts are minimal. However, there are many potential applications of real-time phase tracking of EEG when using more mobile EEG where artifacts are not necessarily negligible. Objective. To evaluate the robustness of PLL based EEG phase tracking when used with artifact corrupted EEG signals. We hypothesize that the intrinsic flywheel action of a PLL means that the tracked phase will not be perturbed by transient artifacts, leading to good tracking performance even in EEG situations without dedicated artifact removal stages being applied. Approach. We tested a PLL algorithm by using single channel (Fp1) EEG data from two datasets, each of which contains both cleaned and artifact contaminated versions of the same underlying EEG signal. We explored whether the PLL has similar phase tracking performance on both versions of the data. The Phase-Locked Value (PLV) was used for evaluating the phase tracking performance. Results. In general, PLV values in excess of 0.7 were obtained for phase tracking performance, irrespective of whether clean or artifact contaminated EEG data was passed to the PLL. Averaged across all EEG frequency bands, we found no statistically significant difference (p < 0.01) between the PLV values generated from clean and contaminated versions of the EEG signals. The phase tracking performance remained stable even as the number of artifacts present increased, with the decrease in average PLV being less than 0.1 even in high artifact cases. Tracking each EEG frequency band in isolation, Delta band tracking was more sensitive to low-frequency and high-amplitude artifacts, such as EOG and jump artifacts, with many PLV values below 0.6 being obtained. Differences in PLV values were much lower for the Theta, Alpha, and Beta bands, with differences increasing in the higher frequency Gamma band. Significance: Robust phase tracking in the presence of artifacts means that dedicated artifact removal processing may not be necessary for closed loop EEG, unless working with Delta or high Gamma band signals. Artifact removal algorithms are computationally complex and resource intensive, and operating without a dedicated removal step may reduce the processing time required, allowing a faster closing of the loop.

IIST BCI Dataset-6 for Selected Common Odia words

Shivani Sahoo

and 4 more

May 30, 2024

Brain-Computer Interface (BCI) is a technology that enables direct communication between the brain and external devices, typically by interpreting neural signals. BCI-based solutions for neurodegenerative disorders need datasets with patients’ native languages. However, research in BCI lacks insufficient language-specific datasets, as seen in Odia, spoken by 35-40 million individuals in India. To address this gap, we developed an Electroencephalograph (EEG) based BCI dataset featuring EEG signal samples of commonly spoken Odia words. Using the OpenBCI Cyton device, EEG recordings are collected from a volunteer who speaks Odia language. The dataset is divided into 4 parts: (i) vocal Odia words, (ii) English translations of these Odia words, (iii) sub-vocalization of the Odia words, and (iv) sub-vocalization of English words. The dataset contains information about 100 different words. Each word is recorded with ten trials. By training the dataset using Machine Learning (ML) and Deep Learning (DL) methods, a BCI system can be designed to translate EEG signals into both vocal and subvocal for the Odia and English languages. This can enhance the communication and quality of Odia-speaking patients with neurodegenerative diseases.

YOLO/SBHK-Net: Occlusion Aware Robot Vision Neural Network with Compound Model Scalin...

Sheekar Banerjee

and 1 more

May 30, 2024

Robot Vision is the technique of enabling robots to process visual data from the environment by utilizing a combination of camera hardware and computer algorithms. Advanced deep neural networks have significantly played a vital role in indulging robots to make more sense out of complex visual data at different circumstances, especially in object detection and continuous tracking. In this research, we initiated a unique and cutting-edge backbone neural network for the conventional YOLO algorithm which we named as SBHK-Net. The network boosted up the performance of the existing YOLO algorithm drastically which manifests a strong potential of improving tracking and recognition accuracies of other conventional algorithms in the robot vision industry as well. It has the greatest accuracy 59.2% AP among all known real-time object detectors with 30 FPS or above on GPU RTX3060, and it outperforms all other known object detectors in the range of 5 FPS to 160 FPS. We used YOLOv7 as our reference point for the core research. The transformer-based detector SWINL Cascade-Mask R-CNN (9.2 FPS A100, 53.9% AP) and the convolutional detector ConvNeXt-XL Cascade-Mask R-CNN (8.6 FPS A100, 55.2% AP) are both outperformed by the SBHK-Net core object detector (56 FPS RTX3060, 56.4% AP) in terms of speed and accuracy, respectively. In terms of speed and accuracy, it surpasses a number of other object detectors, including DINO-5scale-R50, ViT-Adapter-B, Scaled-YOLOv4, YOLOv5, DETR, Deformable DETR, and YOLOR and YoLOX. The source code of this research is available at https://github.com/ac005sheekar/SBHKNet.

CNNs Improve Decoding of Selective Attention to Speech in Cochlear Implant Users

Constantin Jehn

and 4 more

May 30, 2024

Understanding speech in the presence of background noise such as other speech streams is a difficult problem for people with hearing impairment, and in particular for users of cochlear implants (CIs). To improve their listening experience, auditory attention decoding (AAD) aims to decode the target speaker of a listener from electroencephalography (EEG), and then use this information to steer an auditory prosthesis towards this speech signal. In normal-hearing individuals, deep neural networks (DNNs) have been shown to improve AAD compared to simpler linear models. AAD has also been shown to be feasible in CI users using linear models, however, it has not yet been shown that DNNs can yield enhanced decoding accuracies for this patient group. Here we show that attention decoding in CI users can be significantly improved through the usage of a convolutional neural network (CNN). To this end, we first collected an EEG dataset on selective auditory attention from 25 CI users, and then implemented both a linear model as well as a CNN for attention decoding. We observed superior performance of the CNN across all considered decision window sizes, ranging from 1 s to 60 s. Boosted by a Support Vector Machine (SVM) as a trainable classifier, the CNN decoder achieved a maximal mean decoding accuracy of 74% at the population level for a decision window of 60 s duration. Our findings illustrate that the progress made in AAD among normal hearing participants, facilitated by the integration of DNNs, extends to cochlear implant (CI) users.

Machine Learning Identification and Classification of Cancer Cell Behaviors in a Lab-...

Ching-Yi Lin

and 1 more

May 30, 2024

Cell culture assays play a vital role in various fields of biology. Conventional assay techniques like immunohistochemistry, immunofluorescence, and flow cytometry offer valuable insights into cell phenotype and behavior. However, each of these techniques requires labeling or staining, and this is a major drawback, specifically in applications that require compact and integrated analytical devices. To address this shortcoming, CMOS capacitance sensors capable of conducting label-free cell culture assays have been proposed. In this paper, we present a computational framework for further augmenting the capabilities of these capacitance sensors. In our framework, identification and classification of mitosis and migration are achieved by leveraging observations from measured capacitance time series data. Specifically, we engineered two time series features that enable discriminating cell behaviors at the single-cell level. Our feature representation achieves an area under curve (AUC) of 0.719 in the receiver operating characteristic (ROC) curve. Additionally, we show that our feature representation technique is applicable across arbitrary experiments, as validated by a leaveone-run-out test yielding an F-1 score of 0.803 and a G-Mean of 0.647.

Temporal Low-Rank based k-space Sampling Pattern Optimization for MR Fingerprinting

Felix Horger

and 4 more

May 30, 2024

Magnetic Resonance Fingerprinting (MRF) is theoretically more efficient than steady-state quantitative MRI techniques because it exploits dynamic behavior to enhance differences in signals obtained from tissues with different relaxation parameters. In practice, MRF often struggles to deliver the predicted performance, requiring careful adjustment of sequence parameters such as flip-angles, repetition times and k-space sampling patterns. MRF sequences result in a highly undersampled dynamic image series; state-of-the-art methods now exploit a temporal low-rank (TLR) reconstruction approach to help resolve some of the resulting undersampling artifacts. While successful, the TLR reconstruction mixes signals across space and through time, obscuring how sampling might be optimized for best results. This work explores optimal sampling for TLR reconstruction of MRF. We examine conditioning of the reconstruction problem as a predictor of image quality, and propose an effective optimization algorithm for k-space sampling on regular grids. Based on this, we compare different sampling schemes in simulations and real phantom experiments. We explain how undersampling generates aliasing, enhances noise and errors, and demonstrate how parallel-imaging improves this. We also conclude that the final reconstruction quality depends on a combination of undersampling, the expected signal distribution and errors present in real scans, and point towards how these might be included in further optimization efforts.

Contribution of EEG biosignals for Stress Detection

Jonah Fernandez

and 3 more

May 30, 2024

Stress is a prevalent global concern impacting individuals across various life aspects. This paper investigates stress detection using electroencephalographic (EEG) signals, which have proven valuable for studying neural correlates of stress. Stress was induced in participants, and physiological data was recorded as part of the experimental setup. Different feature sets were extracted and four machine learning models, including LightGBM, Convolutional Neural Network (CNN), K-Nearest Neighbors (KNN), and Support Vector Machine (SVM), were utilized for classification tasks. The findings indicate that the mean and standard deviation of 19 channels consistently outperform other feature sets. LightGBM demonstrates superior performance across all scenarios compared to CNN, KNN, and SVM. Overall, this study presents an effective stress detection approach using EEG signals and demonstrates the potential of integrating simple statistical features for enhanced classification accuracy. The findings contribute to the advancement of stress monitoring technologies, with potential applications in wearables and BCIs for real-time stress management.

SSNet: Novel Approach for Fingerprint Recognition in Data-Scarce Scenarios

Saket Pateriya

and 5 more

May 25, 2024

State-of-the-art (SOTA) models typically use large datasets for pre-training and then fine-tune on smaller datasets for better performance. However, the high computational cost can be a barrier for many researchers. There is a need to focus on data size-independent models suited for data-scarce scenarios, which is essential for tasks like fingerprint recognition and could make research more accessible and generalizable in resource-limited environments. With this aim, this paper presents a novel approach to the difficulties in contactless fingerprint recognition, particularly with scarce and poor-quality challenging dataset images due to contactless acquisition. Our proposed system uses a 'Scattering using a Shearlet Network (SSNet)' to extract fingerprint features and a score-level fusion scheme to improve authentication accuracy. In contrast to the computationally expensive and mathematically less transparent dense deep learning networks such as vision transformers, attention networks, deep learning-based hybrid approaches, etc., SSNet is an economical framework with fixed filters. The SSNet is a replacement to the Scattering Wavelet Network (SWN) that utilizes a Complex Morlet Wavelet (CMW). Our model significantly improves verification and identification accuracy over SOTA approaches, particularly with scarce and poor-quality challenging datasets.

Semi-Autonomous Continuous Robotic Arm Control Using an Augmented Reality Brain-Compu...

Kirill Kokorin

and 5 more

May 22, 2024

Noninvasive augmented-reality (AR) brain-computer interfaces (BCIs) that use steady-state visually evoked potentials (SSVEPs) typically adopt a fully autonomous goal-selection framework to control a robot, where automation is used to compensate for the low information transfer rate of the BCI. This scheme improves task performance but users may prefer direct control (DC) of robot motion. To provide users with a balance of autonomous assistance and manual control, we developed a shared control (SC) system for continuous control of robot translation using an SSVEP AR-BCI, which we tested in a 3D reaching task. The SC system used the BCI input and robot sensor data to continuously predict which object the user wanted to reach, generated an assistance signal, and regulated the level of assistance based on prediction confidence. Eighteen healthy participants took part in our study and each completed 24 reaching trials using DC and SC. Compared to DC, SC significantly improved (paired two-tailed t-test, Holm-corrected α<0.05) mean task success rate (p<0.0001, µ=36.1%, 95% CI [25.3%, 46.9%]), normalised reaching trajectory length (p<0.0001, µ=-26.8%, 95% CI [-36.0%,-17.7%]), and participant workload (p<0.02, µ=-11.6, 95% CI [-21.1,-2.0]) measured with the NASA Task Load Index. Therefore, users of SC can control the robot effectively, while experiencing increased agency. Our system can personalise assistive technology by providing users with the ability to select their preferred level of autonomous assistance.

Decentralized Privacy-Preserving Federated Learning for Ultrasonic Nerve Image Segmen...

Gowtham Vinjamuri

and 4 more

May 21, 2024

Currently, Federated Learning is a research approach where multiple parties can train a model together without sharing each other's data to solve complex problems in machine learning. Ultrasound Nerve Segmentation is a computer vision technique that automatically identifies and segments nerve structures in ultrasound images. This technique is particularly important in medical applications where accurate localization of nerves is crucial, such as during anesthesia, nerve blocks, or surgical procedures. Ultrasound Nerve Segmentation can help doctors find nerves better during medical procedures. This could make patients feel better and have better results. Talking about surgery can make even brave patients scared because it hurts and can cause a lot of pain afterward. To reduce pain, people use drugs called narcotics, but these drugs can have bad side effects. The goal of this project is to improve pain management by using indwelling catheters that block or reduce pain at its source. These catheters reduce the need for painkillers and hasten patient healing. It is crucial to precisely identify nerve structures in ultrasound images to guarantee the exact insertion of a patient's pain management catheter. We attempt to build an algorithm that can recognize nerve structures in a dataset of neck ultrasound images in this study. To do this, we created a U-net architecture model that will accept an image as input and forecast an image with the source of the pain highlighted as the output. Achieving this objective would improve catheter placement precision and help in the future with reduced pain.

Fortifying SplitFed Learning: Strengthening Resilience Against Malicious Clients

Ashwin Kumaar

and 2 more

May 21, 2024

This article focuses on analyzing SplitFed Learning against model poisoning vulnerability and developing methods to protect such a system against these attacks. SplitFed learning is a distributed learning paradigm where a neural network model is split between clients and the server, contrasting with traditional Federated Learning. SplitFed learning enables enhanced security and privacy of data, and clients do not need to perform heavy computation in model training, as they only need to train a part of the model. This approach ensures that the model can make precise predictions while maintaining the confidentiality of sensitive information. In addition to implementing a SplitFed Model, the paper proposes a distance-based method that can poison SplitFed Learning-based systems. Subsequently, this paper develops a novel prevention strategy based on robust statistical properties of the sample. To test the proposed methodology, as a test case, we have employed the image cell dataset of the malaria parasite. By addressing the impacts of adversarial attacks, this paper contributes to the advancement of deep learning techniques.

User location uncertainty in RIS-aided channel optimization

Sanaz Kianoush

and 4 more

May 20, 2024

This paper considers an indoor smart radio environment (SRE) where a Base Station (BS) communicates with a set of user equipment (UE) in the sub-THz band by reconfigurable intelligent surfaces (RISs), as the direct BS-UE links are obstructed. Motivated by the sparsity of the sub-THz channel, we model each RIS as an electronically steerable reflector, which can be described by a single parameter, i.e., the steering angle. We focus on the case where the positions of the UEs are unknown and have to be estimated. Specifically, we propose a novel approach to use RISs for jointly localizing the UEs and optimizing the communication performance. The UE localization is made possible by a dedicated RIS and is handled by a machine learning (ML) algorithm, which exploits the signal transmitted by the UEs and received at the BS. Once the estimates of the UE positions are available, the downlink communication between BS and UEs is optimized by properly selecting the electronic steering angle of the RISs so as to maximize the network throughput. By numerical simulations, the paper shows how the system performance is affected by the area of the RISs, by the number of antennas available at the BS, and by the number of steering angles scanned by the RIS used for localization.