Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
MicroExpNet: An Extremely Small and Fast Model For Expression Recognition From Face Images
Download
index.pdf
Date
2019-01-01
Author
Cugu, Ilke
Sener, Eren
Akbaş, Emre
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
181
views
0
downloads
Cite This
This paper is aimed at creating extremely small and fast convolutional neural networks (CNN) for the problem of facial expression recognition (FER) from frontal face images. To this end, we employed the popular knowledge distillation (KD) method and identified two major shortcomings with its use: 1) a fine-grained grid search is needed for tuning the temperature hyperparameter and 2) to find the optimal size-accuracy balance, one needs to search for the final network size (or the compression rate). On the other hand, KD is proved to be useful for model compression for the FER problem, and we discovered that its effects get more and more significant with decreasing model size. In addition, we hypothesized that translation invariance achieved using max-pooling layers would not be useful for the FER problem as the expressions are sensitive to small, pixelwise changes around the eye and the mouth. However, we have found an intriguing improvement in generalization when max-pooling is used. We conducted experiments on two widelyused FER datasets, CK+ and Oulu-CASIA. Our smallest model (MicroExpNet), obtained using knowledge distillation, is less than 1MB in size and works at 1851 frames per second on an Intel i7 CPU. Despite being less accurate than the state-of-the-art, MicroExpNet still provides significant insights for designing a microarchitecture for the FER problem.
URI
https://hdl.handle.net/11511/35901
DOI
https://doi.org/10.1109/ipta.2019.8936114
Collections
Department of Computer Engineering, Conference / Seminar
Suggestions
OpenMETU
Core
Face classification with support vector machine
Kepenekci, B; Akar, Gözde (2004-04-30)
A new approach to feature based frontal face recognition with Gabor wavelets and support vector machines is presented in this paper. The feature points are automatically extracted using the local characteristics of each individual face. A kernel that computes the similarity between two feature vectors, is used to map the face features to a space with higher dimension. To find the identity of a test face, the possible labels of each feature vector of that face is found with support vector machines, then the ...
CEREBRA: A 3-D Visualization Tool for Brain Network Extracted from fMRI Data
Nasır, Barış; Yarman Vural, Fatoş Tunay (2016-08-20)
In this paper, we introduce a new tool, CEREBRA, to visualize the 3D network of human brain, extracted from the fMRI data. The tool aims to analyze the brain connectivity by representing the selected voxels as the nodes of the network. The edge weights among the voxels are estimated by considering the relationships among the voxel time series. The tool enables the researchers to observe the active brain regions and the interactions among them by using graph theoretic measures, such as, the edge weight and n...
A NOVEL BOVW MIMICKING END-TO-END TRAINABLE CNN CLASSIFICATION FRAMEWORK USING OPTIMAL TRANSPORT THEORY
Gürbüz, Yeti Ziya (2019-01-01)
An end-to-end trainable convolutional neural network (CNN) framework which mimics bag of visual words (BoVW) is proposed for image classification. To this end, a new paradigm for histogram-like image representation is introduced and optimal transport (OT) distance is utilized for the similarity assessment. Any patch of an image is considered as a unique visual word and the image is represented as the uniform histogram of the visual words with the histogram bins associated to embedding vectors according to t...
Occluded face recognition based on Gabor wavelets
Kepenekci, B; Tek, FB; Akar, Gözde (2002-09-25)
A new feature based approach to frontal face recognition with Gabor wavelets is presented in this paper. The feature points are automatically extracted using the local characteristics of each individual face in order to decrease the effect of occluded features. There is no training as in neural network approaches, thus single frontal face for each individual is enough as reference. Experimental results show that the proposed method achieves a recognition ratio of over %95.
Tubularity Tracking Based Automatic Road Detection from Sattelite Images
Gürbüz, Yeti Ziya; Alatan, Abdullah Aydın (2014-04-25)
In this paper, a novel approach based on tubularity tracking and graph cuts for road detection from satellites images is presented. The most important feature of the proposed method is its local peak detection filter. Unlike the tubularity based road or road like curvilinear structure detection methods presented in the literature, proposed method samples local peaks from tubularity image by tracking the peak points based on Bayesian filtering in order to construct graphs and introduces no significant comput...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
I. Cugu, E. Sener, and E. Akbaş, “MicroExpNet: An Extremely Small and Fast Model For Expression Recognition From Face Images,” 2019, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/35901.