Expertise in Machine Learning Archives

NPU has 45 TPS whereas TPU v2 has 420 teraflops. Please explain why and how these chips are different from each other?

Saturday, 04 April 2026 by Devendra

The comparison between Neural Processing Units (NPUs) and Tensor Processing Units (TPUs), particularly focusing on an NPU with 45 TPS (Tera Operations Per Second) and the Google TPU v2 with 420 teraflops (TFLOPS), highlights fundamental architectural and operational differences between these classes of specialized hardware accelerators. Understanding these differences requires a thorough exploration of their

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, Diving into the TPU v2 and v3

Tagged under: Artificial Intelligence, Cloud Computing, Edge Computing, Machine Learning Hardware, NPUs, TPUs

What is the difference between TPU and NPU?

Friday, 03 April 2026 by Devendra

The distinction between Tensor Processing Units (TPUs) and Neural Processing Units (NPUs) lies in their historical development, architectural design, target applications, and ecosystem integration within the domain of machine learning hardware acceleration. Both types of processors are purpose-built to handle the computational demands of artificial neural networks, yet each occupies a unique niche in the

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, Tensor Processing Units - history and hardware

Tagged under: Artificial Intelligence, Cloud Computing, Edge Computing, Hardware Acceleration, Machine Learning, NPU, PyTorch, TensorFlow, TPU

What is PyTorch?

Sunday, 28 December 2025 by Codrut Ion

PyTorch is an open-source deep learning framework developed primarily by Facebook’s AI Research lab (FAIR). It provides a flexible and dynamic computational graph architecture, making it highly suitable for research and production in the field of machine learning, particularly for artificial intelligence (AI) applications. PyTorch has gained widespread adoption among academic researchers and industry practitioners

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, PyTorch on GCP

Tagged under: Artificial Intelligence, Cloud Computing, Deep Learning, Distributed Training, GCP, Machine Learning, Neural Networks, PyTorch

After the leap of TPU v3, does the future point to exascale with heterogeneous pods, new precisions beyond bfloat16, and co-optimized architectures with non-volatile memory for multimodal LLMs?

Wednesday, 10 December 2025 by JOSE ALFONSIN PENA

The development of Tensor Processing Units (TPUs) by Google has significantly accelerated the field of large-scale machine learning, particularly for deep learning models that underpin advances in language, vision, and multimodal artificial intelligence. The leap from TPU v2 to TPU v3 marked a substantial increase in computational throughput, memory bandwidth, and system architecture efficiency, positioning

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, Diving into the TPU v2 and v3

Tagged under: Artificial Intelligence, Exascale, Heterogeneous Computing, Non-Volatile Memory, Numerical Precision, TPU

In TPU v1, quantify the effect of FP32→int8 with per-channel vs per-tensor quantization and histogram vs MSE calibration on performance/watt, E2E latency, and accuracy, considering HBM, MXU tiling, and rescaling overhead.

Thursday, 04 December 2025 by JOSE ALFONSIN PENA

The effect of quantization approaches—specifically FP32 to int8 with per-channel versus per-tensor schemes and histogram versus mean squared error (MSE) calibration—on Google TPU v1 performance and accuracy is multifaceted. The interplay among quantization granularity, calibration techniques, hardware tiling, memory bandwidth, and overheads such as rescaling must be comprehensively analyzed to understand their influence on performance

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, Tensor Processing Units - history and hardware

Tagged under: Accuracy, Artificial Intelligence, Calibration, HBM, Int8, Latency, MXU, Performance, Quantization, TPU

What specific vulnerabilities does the bag-of-words model present against adversarial attacks or data manipulation, and what practical countermeasures do you recommend implementing?

Wednesday, 03 December 2025 by JOSE ALFONSIN PENA

The bag-of-words (BoW) model is a foundational technique in natural language processing (NLP) that represents text as an unordered collection of words, disregarding grammar, word order, and, typically, word structure. Each document is converted into a vector based on word occurrence, often using either raw counts or term frequency-inverse document frequency (TF-IDF) values. Despite its

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, Natural language processing - bag of words

Tagged under: Adversarial Attacks, Artificial Intelligence, Bag-of-Words, Data Preprocessing, Feature Engineering, NLP, Security, Text Classification

How can an activation atlas reveal hidden biases in CNNs by analyzing activations from multiple layers in complex images?

Wednesday, 03 December 2025 by JOSE ALFONSIN PENA

An Activation Atlas serves as a comprehensive visual tool that facilitates an in-depth understanding of the internal representations learned by convolutional neural networks (CNNs). By aggregating and clustering activation patterns from multiple layers in response to a diverse range of input images, the Activation Atlas provides a structured map that highlights how the network processes,

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, Understanding image models and predictions using an Activation Atlas

Tagged under: Activation Atlas, Artificial Intelligence, Bias Detection, CNN, Deep Learning, Fairness, Model Interpretability, Visualization

How is it ensured that the value of epsilon in TensorFlow Privacy complies with regulations like the GDPR without compromising the utility of the model?

Tuesday, 02 December 2025 by JOSE ALFONSIN PENA

Ensuring that the privacy parameter epsilon () in TensorFlow Privacy adheres to regulatory frameworks such as the General Data Protection Regulation (GDPR) while maintaining model utility involves a multifaceted approach, combining rigorous privacy accounting, principled choices in differential privacy (DP) configuration, and careful consideration of data utility trade-offs. This process encompasses a detailed understanding of

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, TensorFlow privacy

Tagged under: Artificial Intelligence, Data Protection, Differential Privacy, GDPR Compliance, Machine Learning Regulation, Privacy Accounting

Does using TensorFlow Privacy take more time to train a model than TensorFlow without privacy?

Tuesday, 11 November 2025 by MIRNA HANŽEK

The use of TensorFlow Privacy, which provides differential privacy mechanisms for machine learning models, introduces additional computational overhead compared to standard TensorFlow model training. This increase in computational time is a direct result of the extra mathematical operations required to achieve differential privacy guarantees during the training process. Differential Privacy (DP) is a rigorous mathematical

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, TensorFlow privacy

Tagged under: Artificial Intelligence, Differential Privacy, DP-SGD, Machine Learning, Privacy-Preserving ML, TensorFlow

Is AutoML Tables free?

Saturday, 08 November 2025 by MIRNA HANŽEK

AutoML Tables is a managed machine learning service provided by Google Cloud that enables users to build and deploy machine learning models on structured (tabular) data without requiring extensive expertise in machine learning or coding. It automates the process of data preprocessing, feature engineering, model selection, hyperparameter tuning, and model deployment, making it accessible for

Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Expertise in Machine Learning, AutoML Tables

Tagged under: Artificial Intelligence, AutoML, Cloud Computing, Google Cloud, Machine Learning, Pricing

EITCA Academy

After the leap of TPU v3, does the future point to exascale with heterogeneous pods, new precisions beyond bfloat16, and co-optimized architectures with non-volatile memory for multimodal LLMs?

In TPU v1, quantify the effect of FP32→int8 with per-channel vs per-tensor quantization and histogram vs MSE calibration on performance/watt, E2E latency, and accuracy, considering HBM, MXU tiling, and rescaling overhead.

What specific vulnerabilities does the bag-of-words model present against adversarial attacks or data manipulation, and what practical countermeasures do you recommend implementing?

How can an activation atlas reveal hidden biases in CNNs by analyzing activations from multiple layers in complex images?

Does using TensorFlow Privacy take more time to train a model than TensorFlow without privacy?

EITCA Academy is a part of the European IT Certification framework

EITCA Academy

LOG IN TO YOUR ACCOUNT

FORGOT YOUR PASSWORD?

CREATE AN ACCOUNT