×
1 Choose EITC/EITCA Certificates
2 Learn and take online exams
3 Get your IT skills certified

Confirm your IT skills and competencies under the European IT Certification framework from anywhere in the world fully online.

EITCA Academy

Digital skills attestation standard by the European IT Certification Institute aiming to support Digital Society development

LOG IN TO YOUR ACCOUNT

CREATE AN ACCOUNT FORGOT YOUR PASSWORD?

FORGOT YOUR PASSWORD?

AAH, WAIT, I REMEMBER NOW!

CREATE AN ACCOUNT

ALREADY HAVE AN ACCOUNT?
EUROPEAN INFORMATION TECHNOLOGIES CERTIFICATION ACADEMY - ATTESTING YOUR PROFESSIONAL DIGITAL SKILLS
  • SIGN UP
  • LOGIN
  • INFO

EITCA Academy

EITCA Academy

The European Information Technologies Certification Institute - EITCI ASBL

Certification Provider

EITCI Institute ASBL

Brussels, European Union

Governing European IT Certification (EITC) framework in support of the IT professionalism and Digital Society

  • CERTIFICATES
    • EITCA ACADEMIES
      • EITCA ACADEMIES CATALOGUE<
      • EITCA/CG COMPUTER GRAPHICS
      • EITCA/IS INFORMATION SECURITY
      • EITCA/BI BUSINESS INFORMATION
      • EITCA/KC KEY COMPETENCIES
      • EITCA/EG E-GOVERNMENT
      • EITCA/WD WEB DEVELOPMENT
      • EITCA/AI ARTIFICIAL INTELLIGENCE
    • EITC CERTIFICATES
      • EITC CERTIFICATES CATALOGUE<
      • COMPUTER GRAPHICS CERTIFICATES
      • WEB DESIGN CERTIFICATES
      • 3D DESIGN CERTIFICATES
      • OFFICE IT CERTIFICATES
      • BITCOIN BLOCKCHAIN CERTIFICATE
      • WORDPRESS CERTIFICATE
      • CLOUD PLATFORM CERTIFICATENEW
    • EITC CERTIFICATES
      • INTERNET CERTIFICATES
      • CRYPTOGRAPHY CERTIFICATES
      • BUSINESS IT CERTIFICATES
      • TELEWORK CERTIFICATES
      • PROGRAMMING CERTIFICATES
      • DIGITAL PORTRAIT CERTIFICATE
      • WEB DEVELOPMENT CERTIFICATES
      • DEEP LEARNING CERTIFICATESNEW
    • CERTIFICATES FOR
      • EU PUBLIC ADMINISTRATION
      • TEACHERS AND EDUCATORS
      • IT SECURITY PROFESSIONALS
      • GRAPHICS DESIGNERS & ARTISTS
      • BUSINESSMEN AND MANAGERS
      • BLOCKCHAIN DEVELOPERS
      • WEB DEVELOPERS
      • CLOUD AI EXPERTSNEW
  • FEATURED
  • SUBSIDY
  • HOW IT WORKS
  •   IT ID
  • ABOUT
  • CONTACT
  • MY ORDER
    Your current order is empty.
EITCIINSTITUTE
CERTIFIED

Can you use ML to ground on existing knowledge?

by Patrick van Vilsteren / Tuesday, 28 October 2025 / Published in Artificial Intelligence, EITC/AI/GCML Google Cloud Machine Learning, Introduction, What is machine learning

Machine learning (ML) is fundamentally centered on the concept of using data to automatically learn patterns, relationships, or rules without being explicitly programmed for every task. When considering whether ML can be used to "ground on existing knowledge," one is essentially asking whether ML systems can leverage, build upon, or integrate established bodies of knowledge—such as facts, taxonomies, ontologies, rules, or human expertise—into their learning processes and outputs.

This practice not only enhances the interpretability and robustness of ML models but also aligns them more closely with real-world applications where existing knowledge plays a major role. The following explanation explores the mechanisms, benefits, challenges, and examples of grounding ML systems on prior knowledge, especially within the context of contemporary cloud-based ML environments such as Google Cloud Machine Learning.

Understanding 'Grounding on Existing Knowledge' in ML

Grounding, in this context, refers to the process by which an ML model is linked to or utilizes external, pre-existing knowledge in order to inform its learning, predictions, or actions. This knowledge can take various forms, including:

– Structured knowledge (e.g., knowledge graphs, ontologies, relational databases)
– Unstructured knowledge (e.g., raw text, documents, manuals)
– Domain-specific rules (e.g., medical guidelines, scientific laws)
– Annotated datasets with embedded expert knowledge

The ability of ML to ground on existing knowledge is not only a technical consideration but also a philosophical one, touching upon how artificial systems can best imitate or augment human decision-making, which routinely depends on accumulated knowledge.

Mechanisms for Integrating Existing Knowledge in ML

There are several methodologies by which existing knowledge can be incorporated into ML systems:

1. Feature Engineering with Prior Knowledge
– In traditional ML workflows, domain experts often craft features based on their understanding of the problem. For instance, in medical diagnosis, features such as BMI, blood pressure, or genetic markers are selected based on prior clinical knowledge. These engineered features guide the model to focus on data aspects known to be important.

2. Knowledge-based Constraints and Regularization
– ML models can be constrained to respect certain rules or relationships known a priori. For example, in chemical property prediction, a model might be regularized so that it does not predict physically impossible properties (such as negative concentrations), directly encoding scientific laws into the learning process.

3. Use of Knowledge Graphs and Ontologies
– Knowledge graphs encode entities and their relationships, serving as a rich source of structured knowledge. ML models, especially in natural language processing (NLP), can leverage knowledge graphs like Wikidata or medical ontologies (e.g., SNOMED CT) to enrich text understanding, entity recognition, or reasoning tasks. For instance, an NLP model identifying diseases in text can ground its predictions in a medical ontology to improve accuracy and interpretability.

4. Hybrid Systems (Neuro-Symbolic Integration)
– Hybrid approaches combine symbolic reasoning (based on existing knowledge or rules) with data-driven learning. Systems like IBM’s Watson or Google’s Healthcare Natural Language API combine statistical ML with expert-curated knowledge bases, allowing them to answer questions with both empirical evidence and grounded reasoning.

5. Transfer Learning and Pre-training on Knowledge-rich Datasets
– Models can be pre-trained on large corpora that encode human knowledge (e.g., Wikipedia, scientific literature), thereby internalizing broad conceptual understanding before being fine-tuned for specific tasks. This practice has led to the success of large language models such as BERT and GPT.

6. Human-in-the-loop Learning
– In some scenarios, human feedback or correction is continuously integrated, effectively injecting expert knowledge into the learning process. In active learning and reinforcement learning with expert demonstrations, the model grounds its learning on both experiential data and human expertise.

Benefits of Grounding ML on Existing Knowledge

Grounding ML models on existing knowledge delivers multiple advantages:

– Improved Accuracy and Generalization
By constraining or guiding models with prior knowledge, it becomes less likely for them to overfit to spurious patterns in the data, especially in low-data regimes or when encountering previously unseen situations.

– Enhanced Explainability
Models that leverage explicit knowledge structures can provide more interpretable outputs, as their predictions are tied to understandable, human-recognizable concepts and rules.

– Reduced Data Requirements
When prior knowledge is available, models may require less labeled data to achieve satisfactory performance, since the knowledge serves as a form of inductive bias.

– Safety and Robustness
Hard-coding safety rules or scientific constraints ensures that models do not generate outputs that are unsafe, unethical, or physically impossible.

Challenges and Limitations

While grounding ML on existing knowledge is highly beneficial, several challenges persist:

– Knowledge Representation
Translating human or domain knowledge into a form that is compatible with ML algorithms is non-trivial. Ontologies, knowledge graphs, and rule sets must be designed, maintained, and updated.

– Integration Complexity
Combining symbolic and statistical components in a seamless, efficient way is a longstanding challenge. Neuro-symbolic methods are an area of active research.

– Knowledge Incompleteness and Bias
Existing knowledge bases may be incomplete, outdated, or biased. Relying solely on them can propagate errors or blind spots into ML systems.

– Scalability
Some forms of knowledge integration (e.g., reasoning over large knowledge graphs) can introduce computational bottlenecks.

Practical Examples in Google Cloud Machine Learning Context

1. Healthcare Predictive Analytics
– Google Cloud’s Healthcare API allows the integration of structured medical terminologies (like SNOMED CT and LOINC) with patient data. By grounding ML models in these ontologies, predictive models for patient outcomes can better reflect clinical realities and provide outputs that are interpretable by medical professionals.

2. Retail Product Categorization
– When building product search or recommendation engines, Google Cloud ML models can be trained using category taxonomies and product hierarchies. This enables the models to understand relationships such as “smartphone” is a type of “electronics,” improving relevance and consistency.

3. Natural Language Understanding with Pre-trained Models
– Google Cloud Natural Language API uses models pre-trained on large corpora, which encapsulate significant general knowledge about language, entities, and world facts. This grounding enables the system to accurately identify entities, sentiment, and syntax in user queries across domains.

4. Data Labeling with Human-in-the-loop
– Google Cloud Data Labeling Service allows human annotators to embed domain knowledge into datasets, which in turn grounds the models trained on these datasets. This approach is valuable in domains like autonomous driving, where expert human judgment is critical.

Techniques for Grounding in Practice

– Embedding Knowledge Graphs into ML Pipelines
Embedding techniques such as TransE or Node2Vec can turn symbolic knowledge from graphs into dense vector representations, allowing ML models to use this knowledge directly as features.

– Rule-based Post-processing
After ML inference, domain rules or knowledge bases can be used to filter, adjust, or validate predictions. For instance, a fraud detection system might use ML to flag suspicious transactions and then apply expert rules to escalate cases for human review.

– Multi-modal Learning
Some applications combine different data sources—text, images, structured data—each contributing unique forms of knowledge. For example, in agriculture, satellite images (visual knowledge) and weather reports (factual knowledge) can be fused for crop yield prediction.

Educational Value

Understanding how to ground ML in existing knowledge is critical for both practitioners and students of machine learning. It bridges the gap between purely data-driven statistical models and the vast wealth of structured human knowledge, ensuring that machine learning systems are not only powerful but also reliable, interpretable, and applicable to real-world problems.

Students and professionals alike benefit from mastering both the technical means (such as API integration, knowledge graph embeddings, and hybrid architectures) and the conceptual understanding (such as the role of inductive bias, the importance of explainability, and the risks of propagating bias) necessary to effectively ground ML models on existing knowledge.

The process also illustrates how the field of machine learning is moving toward models that are not isolated pattern recognizers but are instead increasingly context-aware, capable of reasoning with, and augmenting human knowledge. This direction holds promise for applications in science, healthcare, law, and any domain where existing knowledge is critical for responsible and effective automation.

Other recent questions and answers regarding What is machine learning:

  • Given that I want to train a model to recognize plastic types correctly, 1. What should be the correct model? 2. How should the data be labeled? 3. How do I ensure the data collected represents a real-world scenario of dirty samples?
  • How is Gen AI linked to ML?
  • How is a neural network built?
  • How can ML be used in construction and during the construction warranty period?
  • How are the algorithms that we can choose created?
  • How is an ML model created?
  • What are the most advanced uses of machine learning in retail?
  • Why is machine learning still weak with streamed data (for example, trading)? Is it because of data (not enough diversity to get the patterns) or too much noise?
  • How do ML algorithms learn to optimize themselves so that they are reliable and accurate when used on new/unseen data?
  • Answer in Slovak to the question "How can I know which type of learning is the best for my situation?

View more questions and answers in What is machine learning

More questions and answers:

  • Field: Artificial Intelligence
  • Programme: EITC/AI/GCML Google Cloud Machine Learning (go to the certification programme)
  • Lesson: Introduction (go to related lesson)
  • Topic: What is machine learning (go to related topic)
Tagged under: Artificial Intelligence, Explainability, Google Cloud, Human-in-the-Loop, Hybrid Systems, Knowledge Graphs, Machine Learning, Ontologies
Home » Artificial Intelligence » EITC/AI/GCML Google Cloud Machine Learning » Introduction » What is machine learning » » Can you use ML to ground on existing knowledge?

Certification Center

USER MENU

  • My Account

CERTIFICATE CATEGORY

  • EITC Certification (105)
  • EITCA Certification (9)

What are you looking for?

  • Introduction
  • How it works?
  • EITCA Academies
  • EITCI DSJC Subsidy
  • Full EITC catalogue
  • Your order
  • Featured
  •   IT ID
  • EITCA reviews (Medium publ.)
  • About
  • Contact

EITCA Academy is a part of the European IT Certification framework

The European IT Certification framework has been established in 2008 as a Europe based and vendor independent standard in widely accessible online certification of digital skills and competencies in many areas of professional digital specializations. The EITC framework is governed by the European IT Certification Institute (EITCI), a non-profit certification authority supporting information society growth and bridging the digital skills gap in the EU.
Eligibility for EITCA Academy 90% EITCI DSJC Subsidy support
90% of EITCA Academy fees subsidized in enrolment

    EITCA Academy Secretary Office

    European IT Certification Institute ASBL
    Brussels, Belgium, European Union

    EITC / EITCA Certification Framework Operator
    Governing European IT Certification Standard
    Access contact form or call +32 25887351

    Follow EITCI on X
    Visit EITCA Academy on Facebook
    Engage with EITCA Academy on LinkedIn
    Check out EITCI and EITCA videos on YouTube

    Funded by the European Union

    Funded by the European Regional Development Fund (ERDF) and the European Social Fund (ESF) in series of projects since 2007, currently governed by the European IT Certification Institute (EITCI) since 2008

    Information Security Policy | DSRRM and GDPR Policy | Data Protection Policy | Record of Processing Activities | HSE Policy | Anti-Corruption Policy | Modern Slavery Policy

    Automatically translate to your language

    Terms and Conditions | Privacy Policy
    EITCA Academy
    • EITCA Academy on social media
    EITCA Academy


    © 2008-2026  European IT Certification Institute
    Brussels, Belgium, European Union

    TOP
    CHAT WITH SUPPORT
    Do you have any questions?
    We will reply here and by email. Your conversation is tracked with a support token.