Mostrar el registro sencillo del ítem

dc.contributor.author
Zacchigna, Federico Giordano  
dc.contributor.author
Lew, Sergio Eduardo  
dc.contributor.author
Lutenberg, Ariel  
dc.date.available
2024-06-03T13:57:42Z  
dc.date.issued
2024-05  
dc.identifier.citation
Zacchigna, Federico Giordano; Lew, Sergio Eduardo; Lutenberg, Ariel; Flexible Quantization for Efficient Convolutional Neural Networks; MDPI; Electronics; 13; 10; 5-2024; 1-16  
dc.identifier.issn
2079-9292  
dc.identifier.uri
http://hdl.handle.net/11336/236859  
dc.description.abstract
This work focuses on the efficient quantization of convolutional neural networks (CNNs). Specifically, we introduce a method called non-uniform uniform quantization (NUUQ), a novel quantization methodology that combines the benefits of non-uniform quantization, such as high compression levels, with the advantages of uniform quantization, which enables an efficient implementation in fixed-point hardware. NUUQ is based on decoupling the quantization levels from the number of bits. This decoupling allows for a trade-off between the spatial and temporal complexity of the implementation, which can be leveraged to further reduce the spatial complexity of the CNN, without a significant performance loss. Additionally, we explore different quantization configurations and address typical use cases. The NUUQ algorithm demonstrates the capability to achieve compression levels equivalent to 2 bits without an accuracy loss and even levels equivalent to ∼1.58 bits, but with a loss in performance of only ∼0.6%.  
dc.format
application/pdf  
dc.language.iso
eng  
dc.publisher
MDPI  
dc.rights
info:eu-repo/semantics/openAccess  
dc.rights.uri
https://creativecommons.org/licenses/by/2.5/ar/  
dc.subject
CNN  
dc.subject
quantization  
dc.subject
uniform  
dc.subject
non-uniform  
dc.subject
mixed-precision  
dc.subject
FPGA  
dc.subject
ASIC  
dc.subject
edge devices  
dc.subject
embedded systems  
dc.subject.classification
Otras Ingeniería Eléctrica, Ingeniería Electrónica e Ingeniería de la Información  
dc.subject.classification
Ingeniería Eléctrica, Ingeniería Electrónica e Ingeniería de la Información  
dc.subject.classification
INGENIERÍAS Y TECNOLOGÍAS  
dc.title
Flexible Quantization for Efficient Convolutional Neural Networks  
dc.type
info:eu-repo/semantics/article  
dc.type
info:ar-repo/semantics/artículo  
dc.type
info:eu-repo/semantics/publishedVersion  
dc.date.updated
2024-06-03T13:39:13Z  
dc.journal.volume
13  
dc.journal.number
10  
dc.journal.pagination
1-16  
dc.journal.pais
Suiza  
dc.description.fil
Fil: Zacchigna, Federico Giordano. Universidad de Buenos Aires. Facultad de Ingeniería. Departamento de Electronica; Argentina  
dc.description.fil
Fil: Lew, Sergio Eduardo. Universidad de Buenos Aires. Facultad de Ingeniería. Departamento de Electronica; Argentina  
dc.description.fil
Fil: Lutenberg, Ariel. Universidad de Buenos Aires. Facultad de Ingeniería. Departamento de Electronica; Argentina. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina  
dc.journal.title
Electronics  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/url/https://www.mdpi.com/2079-9292/13/10/1923  
dc.relation.alternativeid
info:eu-repo/semantics/altIdentifier/doi/http://dx.doi.org/10.3390/electronics13101923