Is it correct that initial dataset can be spit into three main subsets: the training set, the validation set (to fine-tune parameters), and the testing set (checking performance on unseen data)?
Sunday, 26 November 2023
by Aleksandar Babic
It is indeed correct that the initial dataset in machine learning can be divided into three main subsets: the training set, the validation set, and the testing set. These subsets serve specific purposes in the machine learning workflow and play a important role in developing and evaluating models. The training set is the largest subset
- Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, First steps in Machine Learning, The 7 steps of machine learning
Tagged under:
Artificial Intelligence, Dataset Splitting, Machine Learning, Testing Set, Training Set, Validation Set
How should the input data be formatted for AI Platform Training with built-in algorithms?
Wednesday, 02 August 2023
by EITCA Academy
To properly format input data for AI Platform Training with built-in algorithms, it is essential to follow specific guidelines to ensure accurate and efficient model training. AI Platform provides a variety of built-in algorithms, such as XGBoost, DNN, and Linear Learner, each with its own requirements for data formatting. In this answer, we will discuss

