A two-stage framework for multimodal video classification is proposed.
class="label">•
The model is built based on stacked contractive autoencoders.
class="label">•
The first stage is single modal pre-training.
class="label">•
The second stage is multimodal fine-tuning.
class="label">•
The objective functions are optimized by stochastic gradient descent.
NGLC 2004-2010.National Geological Library of China All Rights Reserved.
Add:29 Xueyuan Rd,Haidian District,Beijing,PRC. Mail Add: 8324 mailbox 100083
For exchange or info please contact us via email.