A SCENE RECOGNITION ALGORITHM BASED ON MULTI-INSTANCE LEARNING

Publications

Share / Export Citation / Email / Print / Text size:

International Journal on Smart Sensing and Intelligent Systems

Professor Subhas Chandra Mukhopadhyay

Exeley Inc. (New York)

Subject: Computational Science & Engineering , Engineering, Electrical & Electronic

GET ALERTS

eISSN: 1178-5608

DESCRIPTION

0
Reader(s)
0
Visit(s)
0
Comment(s)
0
Share(s)

VOLUME 7 , ISSUE 4 (December 2014) > List of articles

A SCENE RECOGNITION ALGORITHM BASED ON MULTI-INSTANCE LEARNING

Tao Wang * / Wenqing Chen / Bailing Wang

Keywords : Image Classification, Multiple Kernel Learning, Bag of Visual Words, Spatial Pyramid Matching

Citation Information : International Journal on Smart Sensing and Intelligent Systems. Volume 7, Issue 4, Pages 1,470-1,492, DOI: https://doi.org/10.21307/ijssis-2017-716

License : (CC BY-NC-ND 4.0)

Received Date : 17-May-2014 / Accepted: 12-October-2014 / Published Online: 01-December-2014

ARTICLE

ABSTRACT

In Bag of Words image presentation model, visual words are generated by unsupervised
clustering, which leaves out the spatial relations between words and results in such shorting comings as
limited semantic description and weak discrimination. To solve this problem, we propose to substitute
visual words by visual phrases in this article. Visual phrases built according to spatial relations between
words are semantic distrainable, and they can improve the accuracy of Bag of Words model.
Considering the traditional classification method based on Bag of Words model is vulnerable to the
background, block and scalar variance of an image, we propose in this article a multiple visual words
learning method for image classification, which is based on the concept of visual phrases combined
with Multiple Instance Learning. The final classification model is able to show the spatial features of
image classes. Experiments performed on standard image testing sets, Caltech 101 and Scene 15, show
the satisfying performance of this algorithm.

Content not available PDF Share

FIGURES & TABLES

REFERENCES

[1] Iliadis, M. ; Seunghwan Yoo ; Xin Xin ; Katsaggelos, A.K,Virtual touring: A Content Based
Image Retrieval application , 2013 IEEE International Conference on Multimedia and Expo
Workshops (ICMEW), pp.1 - 4,2013..
[2] Chucai Yi ; YingLi Tian,Localizing Text in Scene Images by Boundary Clustering, Stroke
Segmentation, and String Fragment Classification , IEEE Transactions on Image
Processing,Volume: 21 , Issue: 9,pp.4256 - 4268,2012.
[3] Maron O, Lozano-Perez T. A Framework for Multiple-Instance Learning [C]. Proceedings of
Neural Information Processing Systems, 10: 570-576, 1998.
[4] Aissam Bekkari, et al., SVM Classification of Urban High-Resolution Imagery Using
Composite Kernels and Contour Information, International Journal of Advanced Computer
Science and Applications , vol. 4, no. 7, 2013.
[5] Wu Z, Ke QF, Sun J. 2009. Bundling features for large-scale partial-duplicate web image
search[C]. In Proc. CVPR, 25-32.
[6] Hong Pan ; Yaping Zhu ; Qin, A.K. ; Liangzheng Xia,Mining heterogeneous class-specific
codebook for categorical object detection and classification ,Image Processing (ICIP), 2013
20th IEEE International Conference on,pp. 3132 - 3136,2013..
[7] Liu D, Hua G, Viola P, Chen T. 2008. Integrated Feature Selection and Higher-order Spatial
Feature Extraction for Object Categorization[C]. Proceeding of the 26th IEEE Conference on
Computer Vision and Pattern Recognition, Anchorage, AK.
[8] M.Iwahara, S.C.Mukhopadhyay, S.Yamada and F.P.Dawson, "Development of Passive Fault
Current Limiter in Parallel Biasing Mode", IEEE Transactions on Magnetics, Vol. 35, No. 5,
pp 3523-3525, September 1999.
[9] Zheng YT, Zhao M, Neo SY, Chua TS, Tian Q. 2008. Visual Synset: towards a Higher-level
Visual Representation[C]. In Proc.CVPR, Achorage, Alaska, U.S.
[10] Yuan YS, Wu Y, Yang M. 2007. Discovery of Collocation Patterns: from Visual Words to
Visual Phrases[C]. Proc. of the 25th IEEE Conference on Computer Vision and Pattern
Recognition, 1-8.
[11] Li FF, Perona P. 2005. A Bayesian Hierarchical Model for Learning Natural Scene
Categories [C]. Proceeding of the 23rd IEEE Computer Society Conference on Computer
Vision and Pattern Recognition, San Diego, CA, USA, 524-531.
[12] Lazebnik S, Schmid C, Ponce J. 2006.Beyond Bags of Features: Spatial Pyramid Matching
forRecognition Natural Scene Categories [C]. Proceeding of the 24th IEEE Computer Society
Conference on Computer Vision and Pattern Recognition, New York, USA, 2: 2169 – 2178。
[13] Lowe D G. 2004. Distinctive Image Features form Scale-invariant Keypoints [J].
International Journal of Computer Vision, 60(2): 91 – 110.
[14] G. Sen Gupta, S.C. Mukhopadhyay, Michael Sutherland and Serge Demidenko, Wireless
Sensor Network for Selective Activity Monitoring in a home for the Elderly, Proceedings of
2007 IEEE IMTC conference, Warsaw, Poland, (6 pages).
[15] Zhang Q, Goldman S A. 2001. EM-DD: an improved multiple-instance learning technique.
Advances in Neural Information Processing Systems, Cambridge, CA: MIT Press, 1073-1080.
[16] Huang X,Chen SC,Shy M, et.al.2002. User concept pattern discovery using relevance
feedback and multiple-instance learning for content-based image retrieval [C].MDM/KDD
2002 Workshop Edmonton, 100-108.
[17] Fergus R, Li Feifei, Perona P, Zisserman A. 2005. Learning Object Categories from
Google’sImage Search [C]. Proceeding of the 10th International Conference on Computer
Vision (ICCV), 1816 - 1823.
[18] Blei D, Ng A, Jordan M. 2003. Latent Dirichlet Allocation [J]. Journal of Machine Learning
Research. 3: 993-1022.
[19] Subhas Chandra Mukhopadhyay, and Chien-Hung Liu, “Designing an Integrated Curriculum
Platform for Engineering Education: A Hybrid Magnetic Bearing System”, International
Journal on Technology and Engineering Education, Vol.7, No.1, pp. 17-31. July 2010.
[20] Cao LL, Li FF. 2007. Spatially Coherent Latent Topic Model for Concurrent Object
Segmentation and Classification[C]. Proceeding of the 11th IEEE International Conference
on Computer Vision. Rio de Janeiro, Brazil, 1080 – 1087.
[21] Dalal N, Triggs B. 2005. Histograms of oriented gradients for human detection Computer [C].
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern
Recognition, 1(1): 886-893.
[22] S. C. Mukhopadhyay, G. Sen Gupta and S. Demidenko, “Intelligent Method of Teaching
Eletectromagnetics Theory” Measurement Under Virtual Environment”, International Journal
on Smart Sensing and Intelligent Systems, Vol. 1, No. 2, June 2008, pp. 443-458.
[23] Kadir T, Brady M. 2001. Scale, Saliency and Image Description [J]. International Journal of
Computer Vision, 45(2): 83-105.
[24] N. K. Suryadevara, S. C. Mukhopadhyay. R.K. Rayudu and Y. M. Huang, Sensor Data
Fusion to determine Wellness of an Elderly in Intelligent Home Monitoring Environment,
Proceedings of IEEE I2MTC 2012 conference, IEEE Catalog number CFP12MT-CDR, ISBN
978-1-4577-1771-0, May 13-16, 2012, Graz, Austria, pp. 947-952.
[25] Yanmin LUO, Peizhong LIU and Minghong LIAO, AN ARTIFICIAL IMMUNE
NETWORK CLUSTERING ALGORITHM FOR MANGROVES REMOTE SENSING,
International Journal on Smart Sensing and Intelligent Systems, VOL. 7, NO. 1, pp. 116 –
134, 2014
[26] Daode Zhang et al., RESEARCH ON CHIPS’ DEFECT EXTRACTION BASED ON
IMAGE-MATCHING, International Journal on Smart Sensing and Intelligent Systems, VOL.
7, NO. 1, pp.321 – 336, 2014.
[27] Sean Dieter Tebje Kelly, Nagender Kumar Suryadevara, and S. C. Mukhopadhyay, "Towards
the Implementation of IoT for Environmental Condition Monitoring in Homes" IEEE
SENSORS JOURNAL, VOL. 13, NO. 10, OCTOBER 2013, pp. 3846-3853.

EXTRA FILES

COMMENTS