Comparing Humans and Deep Neural Networks on Face Recognition Under Various Distance and Rotation Viewing Conditions

dc.contributor.author Fux, Michal
dc.contributor.author Arslan , Şuayb Şefik
dc.contributor.author Jang, Hojin
dc.contributor.author Boix, Xavier
dc.contributor.author Cooper, Avi
dc.contributor.author Groth, Matt J
dc.contributor.author Sinha, Pawan
dc.date.accessioned 2023-11-21T12:39:54Z
dc.date.available 2023-11-21T12:39:54Z
dc.date.issued 2023
dc.description.abstract Humans possess impressive skills for recognizing faces even when the viewing conditions are challenging, such as long ranges, non-frontal regard, variable lighting, and atmospheric turbulence. We sought to characterize the effects of such viewing conditions on the face recognition performance of humans, and compared the results to those of DNNs. In an online verification task study, we used a 100 identity face database, with images captured at five different distances (2m, 5m, 300m, 650m and 1000m) three pitch values (00 - straight ahead, +/- 30 degrees) and three levels of yaw (00, 45, and 90 degrees). Participants were presented with 175 trials (5 distances x 7 yaw and pitch combinations, with 5 repetitions). Each trial included a query image, from a certain combination of range x yaw x pitch, and five options, all frontal short range (2m) faces. One was of the same identity as the query, and the rest were the most similar identities, chosen according to a DNN-derived similarity matrix. Participants ranked the top three most similar target images to the query image. The collected data reveal the functional relationship between human performance and multiple viewing parameters. Nine state-of-the-art pre-trained DNNs were tested for their face recognition performance on precisely the same stimulus set. Strikingly, DNN performance was significantly diminished by variations in ranges and rotated viewpoints. Even the best-performing network reported below 65% accuracy at the closest distance with a profile view of faces, with results dropping to near chance for longer ranges. The confusion matrices of DNNs were generally consistent across the networks, indicating systematic errors induced by viewing parameters. Taken together, these data not only help characterize human performance as a function of key ecologically important viewing parameters, but also enable a direct comparison of humans and DNNs in this parameter regime
dc.identifier.citation Fux, M., Arslan, S. S., Jang, H., Boix, X., Cooper, A., Groth, M. J., & Sinha, P. (2023). Comparing Humans and Deep Neural Networks on face recognition under various distance and rotation viewing conditions. Journal of Vision, 23(9), 5916-5916.
dc.identifier.doi 10.1167/jov.23.9.5916
dc.identifier.issn 1534-7362
dc.identifier.uri https://doi.org/10.1167/jov.23.9.5916
dc.identifier.uri https://hdl.handle.net/20.500.11779/2133
dc.language.iso en
dc.publisher Journal of Vision
dc.relation.ispartof Journal of Vision
dc.rights info:eu-repo/semantics/openAccess
dc.title Comparing Humans and Deep Neural Networks on Face Recognition Under Various Distance and Rotation Viewing Conditions
dc.type Article
dspace.entity.type Publication
gdc.author.id Şuayb Şefik Arslan / 0000-0003-3779-0731
gdc.author.institutional Arslan, Şuayb Şefik
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.description.department Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
gdc.description.issue 9
gdc.description.publicationcategory Makale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanı
gdc.description.scopusquality Q3
gdc.description.startpage 5916
gdc.description.volume 23
gdc.description.wosquality Q2
gdc.identifier.openalex W4386244680
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 2.0
gdc.oaire.influence 2.652194E-9
gdc.oaire.isgreen true
gdc.oaire.popularity 4.387362E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.sciencefields 0302 clinical medicine
gdc.oaire.sciencefields 05 social sciences
gdc.oaire.sciencefields 0501 psychology and cognitive sciences
gdc.openalex.collaboration International
gdc.openalex.fwci 0.79131826
gdc.openalex.normalizedpercentile 0.64
gdc.opencitations.count 0
gdc.publishedmonth Ağustos
gdc.relation.journal Vision Sciences Society Annual Meeting Abstract
gdc.virtual.author Arslan, Şefik Şuayb
gdc.wos.publishedmonth Ağustos
gdc.yokperiod YÖK - 2022-23
relation.isAuthorOfPublication 37152966-5384-4fd7-a0dc-34d1dd8bdc7f
relation.isAuthorOfPublication.latestForDiscovery 37152966-5384-4fd7-a0dc-34d1dd8bdc7f
relation.isOrgUnitOfPublication 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3
relation.isOrgUnitOfPublication 0d54cd31-4133-46d5-b5cc-280b2c077ac3
relation.isOrgUnitOfPublication a6e60d5c-b0c7-474a-b49b-284dc710c078
relation.isOrgUnitOfPublication.latestForDiscovery 05ffa8cd-2a88-4676-8d3b-fc30eba0b7f3

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: