posted by user: NCAA || 1502 views || tracked by 1 users: [display]

Image and Vision Computing Special Issue 2020 : Image and Vision Computing (IVC) Special Issue on Deep Cross-Media Neural Model for Generating Image Descriptions

FacebookTwitterLinkedInGoogle

Link: https://www.journals.elsevier.com/image-and-vision-computing/call-for-papers/neural-model-for-generating-image-descriptions
 
When N/A
Where N/A
Submission Deadline TBD
Categories    deep cross-media neural model   image understanding   image descriptions
 

Call For Papers

Summary and Scope:

Understanding and generating image descriptions (UGID) are hot topics that combines the computer vision (CV) and natural language processing (NLP). UGID has broad application prospects in many fields of AI. Different from coarse-grained image understanding of independent labeling, the image description task needs to learn the natural language descriptions of images. This requires not only the model to recognize the objects in the image, but also other visual elements (e.g., actions and attributes of objects), but also understand the interrelationships between objects and generate human-readable description sentences, which is challenging. The real image understanding is to describe image with natural language and let the machine emulate humans for better human-computer interaction. With the fast development of deep learning in the fields of CV and NLP, the encoder-decoder based deep neural models have obtained breakthrough results in generating image descriptions in cross-media domains. As such, the image understanding may become a reality in future. However, current models can only provide a simple description about image, i.e., the number of descriptive words is usually limited and even the sentences are logically wrong.

In this special issue, we invite the original contributions from diverse research fields, developing new deep cross-media neural model for understanding and generating image descriptions, which aims to reduce the gap between image understanding and natural language descriptions.

The topics of interest include, but are not limited to:

Attention guided UGID Visual relationship in UGID Compositional architectures for UGID Multimodal learning for UGID Describing novel objects in UGID Natural language processing model New datasets for UGID Novel encoder-decoder based architecture Deep cross-media neural model with applications of UGID, e.g., early childhood education, medical image analysis, assisted blinding and news automation, etc.
Important Dates:

Paper submission due: Oct 20, 2020

First notification: Dec 20, 2020

Final decision made on all manuscripts: Mar 30, 2021

Managing Guest Editor:

Prof. Zhao Zhang, Hefei University of Technology, China

Other Guest Editors:

Dr. Sheng Li, University of Georgia, USA

Prof. Meng Wang, Hefei University of Technology, China

Prof. Shuicheng Yan, National University of Singapore, Singapore

Related Resources

IEEE-Ei/Scopus-ITCC 2025   2025 5th International Conference on Information Technology and Cloud Computing (ITCC 2025)-EI Compendex
3CVIP 2025   2025 4th Asia Conference on Cloud Computing, Computer Vision and Image Processing (3CVIP 2025)
SPIE-Ei/Scopus-DMNLP 2025   2025 2nd International Conference on Data Mining and Natural Language Processing (DMNLP 2025)-EI Compendex&Scopus
CMVIT-Maldives 2025   2025 9th International Conference on Machine Vision and Information Technology (CMVIT 2025)
MobiCASE 2025   16th EAI International Conference on Mobile Computing, Applications and Services
CGIP 2025   2025 3rd International Conference on Computer Graphics and Image Processing (CGIP 2025)
IEEE ICMVA 2025   IEEE--2025 The 8th International Conference on Machine Vision and Applications (ICMVA 2025)
IWIP 2025   2025 5th International Workshop on Image Processing (IWIP 2025)
Maldives-CMVIT 2025   2025 9th International Conference on Machine Vision and Information Technology (CMVIT 2025)
ICGSP 2025   2025 The 9th International Conference on Graphics and Signal Processing (ICGSP 2025)