×
1 Choose EITC/EITCA Certificates
2 Learn and take online exams
3 Get your IT skills certified

Confirm your IT skills and competencies under the European IT Certification framework from anywhere in the world fully online.

EITCA Academy

Digital skills attestation standard by the European IT Certification Institute aiming to support Digital Society development

LOG IN TO YOUR ACCOUNT

CREATE AN ACCOUNT FORGOT YOUR PASSWORD?

FORGOT YOUR PASSWORD?

AAH, WAIT, I REMEMBER NOW!

CREATE AN ACCOUNT

ALREADY HAVE AN ACCOUNT?
EUROPEAN INFORMATION TECHNOLOGIES CERTIFICATION ACADEMY - ATTESTING YOUR PROFESSIONAL DIGITAL SKILLS
  • SIGN UP
  • LOGIN
  • INFO

EITCA Academy

EITCA Academy

The European Information Technologies Certification Institute - EITCI ASBL

Certification Provider

EITCI Institute ASBL

Brussels, European Union

Governing European IT Certification (EITC) framework in support of the IT professionalism and Digital Society

  • CERTIFICATES
    • EITCA ACADEMIES
      • EITCA ACADEMIES CATALOGUE<
      • EITCA/CG COMPUTER GRAPHICS
      • EITCA/IS INFORMATION SECURITY
      • EITCA/BI BUSINESS INFORMATION
      • EITCA/KC KEY COMPETENCIES
      • EITCA/EG E-GOVERNMENT
      • EITCA/WD WEB DEVELOPMENT
      • EITCA/AI ARTIFICIAL INTELLIGENCE
    • EITC CERTIFICATES
      • EITC CERTIFICATES CATALOGUE<
      • COMPUTER GRAPHICS CERTIFICATES
      • WEB DESIGN CERTIFICATES
      • 3D DESIGN CERTIFICATES
      • OFFICE IT CERTIFICATES
      • BITCOIN BLOCKCHAIN CERTIFICATE
      • WORDPRESS CERTIFICATE
      • CLOUD PLATFORM CERTIFICATENEW
    • EITC CERTIFICATES
      • INTERNET CERTIFICATES
      • CRYPTOGRAPHY CERTIFICATES
      • BUSINESS IT CERTIFICATES
      • TELEWORK CERTIFICATES
      • PROGRAMMING CERTIFICATES
      • DIGITAL PORTRAIT CERTIFICATE
      • WEB DEVELOPMENT CERTIFICATES
      • DEEP LEARNING CERTIFICATESNEW
    • CERTIFICATES FOR
      • EU PUBLIC ADMINISTRATION
      • TEACHERS AND EDUCATORS
      • IT SECURITY PROFESSIONALS
      • GRAPHICS DESIGNERS & ARTISTS
      • BUSINESSMEN AND MANAGERS
      • BLOCKCHAIN DEVELOPERS
      • WEB DEVELOPERS
      • CLOUD AI EXPERTSNEW
  • FEATURED
  • SUBSIDY
  • HOW IT WORKS
  •   IT ID
  • ABOUT
  • CONTACT
  • MY ORDER
    Your current order is empty.
EITCIINSTITUTE
CERTIFIED

How do we populate dictionaries for the train and test sets?

by EITCA Academy / Monday, 07 August 2023 / Published in Artificial Intelligence, EITC/AI/MLP Machine Learning with Python, Programming machine learning, Applying own K nearest neighbors algorithm, Examination review

To populate dictionaries for the train and test sets in the context of applying one's own K nearest neighbors (KNN) algorithm in machine learning using Python, we need to follow a systematic approach. This process involves converting our data into a suitable format that can be used by the KNN algorithm.

First, let's understand the basic concept of dictionaries in Python. A dictionary is an unordered collection of key-value pairs, where each key is unique. In the context of machine learning, dictionaries are commonly used to represent datasets, where the keys correspond to the features or attributes, and the values represent the corresponding data points.

To populate dictionaries for the train and test sets, we need to perform the following steps:

1. Data Preparation: Start by collecting and preparing the data for our machine learning task. This typically involves cleaning the data, handling missing values, and transforming the data into a suitable format. Ensure that the data is properly labeled or categorized, as this is essential for supervised learning tasks.

2. Splitting the Dataset: Next, we need to split our dataset into two parts: the train set and the test set. The train set will be used to train our KNN algorithm, while the test set will be used to evaluate its performance. This split helps us assess how well our algorithm generalizes to unseen data.

3. Feature Extraction: Once the dataset is split, we need to extract the relevant features from the data and assign them as keys in our dictionaries. Features can be numerical or categorical, depending on the nature of our data. For example, if we are working with a dataset of images, we may extract features such as color histograms or texture descriptors.

4. Assigning Values: After extracting the features, we need to assign the corresponding values to each key in our dictionaries. These values represent the actual data points or instances in our dataset. Each instance should be associated with its corresponding feature values.

5. Train Set Dictionary: Create a dictionary to represent the train set. The keys of this dictionary will be the features, and the values will be lists or arrays containing the corresponding feature values for each instance in the train set. For example, if we have a dataset with two features (age and income) and three instances, the train set dictionary may look like this:

train_set = {'age': [25, 30, 35], 'income': [50000, 60000, 70000]}

6. Test Set Dictionary: Similarly, create a dictionary to represent the test set. The keys of this dictionary will be the same features as in the train set, and the values will be lists or arrays containing the corresponding feature values for each instance in the test set. For example, if we have a test set with two instances, the test set dictionary may look like this:

test_set = {'age': [40, 45], 'income': [80000, 90000]}

7. Utilizing the Dictionaries: Once the dictionaries for the train and test sets are populated, we can use them as inputs to our own KNN algorithm. The algorithm will utilize the feature values from the train set to make predictions or classifications for the instances in the test set.

By following these steps, we can effectively populate dictionaries for the train and test sets in the context of applying our own KNN algorithm in machine learning using Python. These dictionaries serve as the foundation for training and evaluating our algorithm's performance.

To populate dictionaries for the train and test sets, we need to prepare and split the dataset, extract the relevant features, assign the feature values to the corresponding keys in the dictionaries, and utilize these dictionaries in our own KNN algorithm.

Other recent questions and answers regarding Applying own K nearest neighbors algorithm:

  • How do we calculate the accuracy of our own K nearest neighbors algorithm?
  • What is the significance of the last element in each list representing the class in the train and test sets?
  • What is the purpose of shuffling the dataset before splitting it into training and test sets?
  • Why is it important to clean the dataset before applying the K nearest neighbors algorithm?

More questions and answers:

  • Field: Artificial Intelligence
  • Programme: EITC/AI/MLP Machine Learning with Python (go to the certification programme)
  • Lesson: Programming machine learning (go to related lesson)
  • Topic: Applying own K nearest neighbors algorithm (go to related topic)
  • Examination review
Tagged under: Artificial Intelligence, Data Preparation, Dictionaries, KNN Algorithm, Machine Learning, Python
Home » Applying own K nearest neighbors algorithm / Artificial Intelligence / EITC/AI/MLP Machine Learning with Python / Examination review / Programming machine learning » How do we populate dictionaries for the train and test sets?

Certification Center

USER MENU

  • My Account

CERTIFICATE CATEGORY

  • EITC Certification (105)
  • EITCA Certification (9)

What are you looking for?

  • Introduction
  • How it works?
  • EITCA Academies
  • EITCI DSJC Subsidy
  • Full EITC catalogue
  • Your order
  • Featured
  •   IT ID
  • EITCA reviews (Medium publ.)
  • About
  • Contact

EITCA Academy is a part of the European IT Certification framework

The European IT Certification framework has been established in 2008 as a Europe based and vendor independent standard in widely accessible online certification of digital skills and competencies in many areas of professional digital specializations. The EITC framework is governed by the European IT Certification Institute (EITCI), a non-profit certification authority supporting information society growth and bridging the digital skills gap in the EU.

Eligibility for EITCA Academy 80% EITCI DSJC Subsidy support

80% of EITCA Academy fees subsidized in enrolment by

    EITCA Academy Secretary Office

    European IT Certification Institute ASBL
    Brussels, Belgium, European Union

    EITC / EITCA Certification Framework Operator
    Governing European IT Certification Standard
    Access contact form or call +32 25887351

    Follow EITCI on X
    Visit EITCA Academy on Facebook
    Engage with EITCA Academy on LinkedIn
    Check out EITCI and EITCA videos on YouTube

    Funded by the European Union

    Funded by the European Regional Development Fund (ERDF) and the European Social Fund (ESF) in series of projects since 2007, currently governed by the European IT Certification Institute (EITCI) since 2008

    Information Security Policy | DSRRM and GDPR Policy | Data Protection Policy | Record of Processing Activities | HSE Policy | Anti-Corruption Policy | Modern Slavery Policy

    Automatically translate to your language

    Terms and Conditions | Privacy Policy
    EITCA Academy
    • EITCA Academy on social media
    EITCA Academy


    © 2008-2025  European IT Certification Institute
    Brussels, Belgium, European Union

    TOP
    Chat with Support
    Chat with Support
    Questions, doubts, issues? We are here to help you!
    End chat
    Connecting...
    Do you have any questions?
    Do you have any questions?
    :
    :
    :
    Send
    Do you have any questions?
    :
    :
    Start Chat
    The chat session has ended. Thank you!
    Please rate the support you've received.
    Good Bad