Recent Articles

Open access

ISSN: 2096-5796
CN: 10-1561/TP
e-ISSN: 2666-1209

Efficient and lightweight 3D building reconstruction from drone imagery using sparse line and point clouds

Efficient three-dimensional (3D) building reconstruction from drone imagery often faces data acquisition, storage, and computational challenges because of its reliance on dense point clouds. In this...

Deconfounded fashion image captioning with transformer and multimodal retrieval

The annotation of fashion images is a significantly important task in the fashion industry as well as social media and e-commerce. However, owing to the complexity and diversity of fashion images, this...

DeepSafe:Two-level deep learning approach for disaster victims detection

Efficient disaster victim detection (DVD) in urban areas after natural disasters is crucial for minimizing losses. However, conventional search and rescue (SAR) methods often experience delays, which...

A haptic feedback glove for virtual piano interaction

Haptic feedback plays a crucial role in virtual reality (VR) interaction, helping to improve the precision of user operation and enhancing the immersion of the user experience. Instrumental haptic feedback...

YGC-SLAM:A visual SLAM based on improved YOLOv5 and geometric constraints for dynamic indoor environments

As visual simultaneous localization and mapping (SLAM) is primarily based on the assumption of a static scene, the presence of dynamic objects in the frame causes problems such as a deterioration of...

Chasing in virtual environment:Dynamic alignment for multi-user collaborative redirected walking

The redirected walking (RDW) method for multi-user collaboration requires maintaining the relative position between users in a virtual environment (VE) and physical environment (PE). A chasing game...

Finger tracking for wearable VR glove using flexible rack mechanism

With the increasing prominence of hand and finger motion tracking in virtual reality (VR) applications and rehabilitation studies, data gloves have emerged as a prevalent solution. In this study, we...

FDCPNet:feature discrimination and context propagation network for 3D shape representation

Three-dimensional (3D) shape representation using mesh data is essential in various applications, such as virtual reality and simulation technologies. Current methods for extracting features from mesh...

Optimizing wireless sensor network topology with node load consideration

With the development of the Internet, the topology optimization of wireless sensor networks has received increasing attention. However, traditional optimization methods often overlook the energy imbalance...

Survey of neurocognitive disorder detection methods based on speech, visual, and virtual reality technologies

The global trend of population aging poses significant challenges to society and healthcare systems, particularly because of neurocognitive disorders (NCDs) such as Parkinson's disease (PD) and Alzheimer's...

Previs-Real:Interactive virtual previsualization system for news shooting rehearsal and evaluation

In the demanding field of live news broadcasting, the intricate studio production procedures and tight schedules pose significant challenges for physical rehearsals by cameramen. This paper explores...

InputJump: Augmented reality-facilitated cross-device input fusion based on spatial and semantic information

The proliferation of computing devices requires seamless cross-device interactions. Augmented reality (AR) headsets can facilitate interactions with existing computers owing to their user-centered views...

Automatic piano performance interaction system based on greedy algorithm for dexterous manipulator

With continuous advancements in artificial intelligence (AI), automatic piano-playing robots have become subjects of cross-disciplinary interest. However, in most studies, these robots served merely...

MatStick: Changing the material sensation of objects upon impact

An increasing number of studies have focused on providing rich tactile feedback in virtual reality interactive scenarios. In this study, we addressed a tapping scenario in virtual reality by designing...

Pre-training transformer with dual-branch context content module for table detection in document images

Document images such as statistical reports and scientific journals are widely used in information technology. Accurate detection of table areas in document images is an essential prerequisite for tasks...

Music-stylized hierarchical dance synthesis with user control

Synthesizing dance motions to match musical inputs is a significant challenge in animation research. Compared to functional human motions, such as locomotion, dance motions are creative and artistic,...

Mesh representation matters: investigating the influence of different mesh features on perceptual and spatial fidelity of deep 3D morphable models

Deep 3D morphable models (deep 3DMMs) play an essential role in computer vision. They are used in facial synthesis, compression, reconstruction and animation, avatar creation, virtual try-on, facial...

Co-salient object detection with iterative purification and predictive optimization

Co-salient object detection (Co-SOD) aims to identify and segment commonly salient objects in a set of related images. However, most current Co-SOD methods encounter issues with the inclusion of irrelevant...

CURDIS: A template for incremental curve discretization algorithms and its application to conics

We introduce CURDIS, a template for algorithms to discretize arcs of regular curves by incrementally producing a list of support pixels covering the arc. In this template, algorithms proceed by finding...

Generating animatable 3D cartoon faces from single portraits

With the development of virtual reality (VR) technology, there is a growing need for customized 3D avatars. However, traditional methods for 3D avatar modeling are either time-consuming or fail to retain...

Robust blind image watermarking based on interest points

Digital watermarking technology plays an essential role in the work of anti-counterfeiting and traceability. However, image watermarking algorithms are weak against hybrid attacks, especially geometric...

S2ANet: Combining local spectral and spatial point grouping for point cloud processing

Despite the recent progress in 3D point cloud processing using deep convolutional neural networks, the inability to extract local features remains a challenging problem. In addition, existing methods...

MKEAH: Multimodal knowledge extraction and accumulation based on hyperplane embedding for knowledge-based visual question answering

External knowledge representations play an essential role in knowledge-based visual question and answering to better understand complex scenarios in the open world. Recent entity-relationship embedding...

Multi-scale context-aware network for continuous sign language recognition

The hands and face are the most important parts for expressing sign language morphemes in sign language videos. However, we find that existing Continuous Sign Language Recognition (CSLR) methods lack...

Automatic detection of breast lesions in automated 3D breast ultrasound with cross-organ transfer learning

Deep convolutional neural networks have garnered considerable attention in numerous machine learning applications, particularly in visual recognition tasks such as image and video analyses. There is...

Stay Informed

Register your interest and receive email alerts tailored to your needs. Sign up below.