Quantization Archives - EITCA Academy

In TPU v1, quantify the effect of FP32→int8 with per-channel vs per-tensor quantization and histogram vs MSE calibration on performance/watt, E2E latency, and accuracy, considering HBM, MXU tiling, and rescaling overhead.

Thursday, 04 December 2025 by JOSE ALFONSIN PENA

The effect of quantization approaches—specifically FP32 to int8 with per-channel versus per-tensor schemes and histogram versus mean squared error (MSE) calibration—on Google TPU v1 performance and accuracy is multifaceted. The interplay among quantization granularity, calibration techniques, hardware tiling, memory bandwidth, and overheads such as rescaling must be comprehensively analyzed to understand their influence on performance

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, Tensor Processing Units - history and hardware

Tagged under: Accuracy, Artificial Intelligence, Calibration, HBM, Int8, Latency, MXU, Performance, Quantization, TPU

What impact does post-training quantization have when converting a TensorFlow object detection model to TensorFlow Lite in terms of accuracy and performance on iOS devices?

Thursday, 30 October 2025 by JOSE ALFONSIN PENA

Post-training quantization is a widely adopted technique used to optimize deep learning models—such as those built with TensorFlow—for deployment on edge devices, including iOS smartphones and tablets. When converting a TensorFlow object detection model to TensorFlow Lite, quantization offers significant benefits in terms of both model size and inference speed, but it also introduces certain

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Google tools for Machine Learning, TensorFlow object detection on iOS

Tagged under: Artificial Intelligence, IOS, Object Detection, Quantization, TensorFlow, TensorFlow Lite

How to install JAX on Hailo 8?

Saturday, 20 September 2025 by Michał Otoka

Installing JAX on the Hailo-8 platform requires a comprehensive understanding of both the JAX framework and the Hailo-8 hardware/software stack. The Hailo-8 is a specialized AI accelerator designed for edge devices, optimized for running deep learning inference tasks with high efficiency and low power consumption. JAX, developed by Google, is a Python library for high-performance

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Google Cloud AI Platform, Introduction to JAX

Tagged under: Artificial Intelligence, Edge AI, Hailo-8, JAX, Model Deployment, ONNX, Quantization, TensorFlow

When working with quantization technique, is it possible to select in software the level of quantization to compare different scenarios precision/speed?

Wednesday, 21 February 2024 by Arcadio Martín

When working with quantization techniques in the context of Tensor Processing Units (TPUs), it is essential to understand how quantization is implemented and whether it can be adjusted at the software level for different scenarios involving precision and speed trade-offs. Quantization is a important optimization technique used in machine learning to reduce the computational and

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, Tensor Processing Units - history and hardware

Tagged under: Artificial Intelligence, Machine Learning, Optimization, Quantization, TensorFlow, TPU

What are the boundary conditions imposed on the wave function of the particle in a box, and how do they affect the quantization of the wave vector?

Sunday, 06 August 2023 by EITCA Academy

In the field of Quantum Information, specifically in the study of the Particle in a Box system, the wave function of the particle is subject to certain boundary conditions. These boundary conditions play a important role in determining the quantization of the wave vector. The Particle in a Box system is a simplified model used

Published in Quantum Information, EITC/QI/QIF Quantum Information Fundamentals, Instroduction to implementing qubits, Particle in a box, Examination review

Tagged under: Boundary Conditions, Particle In A Box, Quantization, Quantum Information, Quantum Mechanics, Wave Function

How does TensorFlow Lite enable the efficient execution of machine learning models on resource-constrained platforms?

Saturday, 05 August 2023 by EITCA Academy

TensorFlow Lite is a framework that enables the efficient execution of machine learning models on resource-constrained platforms. It addresses the challenge of deploying machine learning models on devices with limited computational power and memory, such as mobile phones, embedded systems, and IoT devices. By optimizing the models for these platforms, TensorFlow Lite allows for real-time

Published in Artificial Intelligence, EITC/AI/TFF TensorFlow Fundamentals, Programming TensorFlow, Introduction to TensorFlow coding, Examination review

Tagged under: Artificial Intelligence, Hardware Acceleration, Machine Learning, Model Compression, Model Optimization, Quantization, Resource-constrained Platforms, TensorFlow Lite

Explain the technique of quantization and its role in reducing the precision of the TPU V1.

Wednesday, 02 August 2023 by EITCA Academy

Quantization is a technique used in the field of machine learning to reduce the precision of numerical values, particularly in the context of Tensor Processing Units (TPUs). TPUs are specialized hardware developed by Google to accelerate machine learning workloads. They are designed to perform matrix operations efficiently and at high speed, making them ideal for

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, Tensor Processing Units - history and hardware, Examination review

Tagged under: Artificial Intelligence, Machine Learning, Precision, Quantization, Tensor Processing Units, TPU V1

We care about your privacy

EITCI uses cookies and similar technologies to keep this site secure, remember your choices, provide personalized experience, measure the traffic, serve more relevant content and certification programmes. You can accept all cookies or customize your preferences. Cookies are variables used to store website specific information on your device to facilitate processing of data for personalized website visit, such as login to your account, accessing the programmes, placing enrolment orders in chosen programmes and improving your EITC certification journey. You can change or withdraw your consent at any time by clicking the Consent Preferences button at the left-bottom of your screen. We respect your choices and are committed to providing you with a transparent and secure browsing experience, which may be limited when cookies aren't accepted. For more details refer to the Privacy Policy

EITCA Academy

In TPU v1, quantify the effect of FP32→int8 with per-channel vs per-tensor quantization and histogram vs MSE calibration on performance/watt, E2E latency, and accuracy, considering HBM, MXU tiling, and rescaling overhead.

What impact does post-training quantization have when converting a TensorFlow object detection model to TensorFlow Lite in terms of accuracy and performance on iOS devices?

How to install JAX on Hailo 8?

When working with quantization technique, is it possible to select in software the level of quantization to compare different scenarios precision/speed?

What are the boundary conditions imposed on the wave function of the particle in a box, and how do they affect the quantization of the wave vector?

How does TensorFlow Lite enable the efficient execution of machine learning models on resource-constrained platforms?

Explain the technique of quantization and its role in reducing the precision of the TPU V1.

EITCA Academy is a part of the European IT Certification framework

We care about your privacy

Necessary

Functional

Preferences

External media and social features

Analytics

Marketing and conversions

EITCA Academy

LOG IN TO YOUR ACCOUNT

FORGOT YOUR PASSWORD?

CREATE AN ACCOUNT

We care about your privacy