Getting started with semantic Climate
06 Jun, 2024
by Team
π A Beginnerβs Guide to #semanticClimate
Welcome to the #semanticClimate Internship Program β a hands-on opportunity to contribute to climate knowledge while building valuable technical skills.
This internship allows you to work at the intersection of climate science, data analysis, and semantic technologies , helping transform complex climate reports into structured, accessible knowledge.
π Why Join?
Interns are at the core of our work. By contributing to real-world climate data projects, you will:
Support efforts to make climate knowledge more accessible
Work with real scientific reports (e.g., IPCC)
Gain practical experience in tools, data analysis, and research workflows
Develop transferable skills in NLP, data processing, and web-based systems
β³ Internship Duration
Duration: 12 weeks to 6 months
Mode: Online or Offline (flexible)
π οΈ The Program
As an intern, you will install, test, and work with semanticClimate toolkits designed for analyzing and structuring climate reports.
These tools help convert unstructured documents into meaningful, machine-readable knowledge.
The tools are the following:
π Getting Started
Hereβs a list of general tasks to begin your internship :
Set up the development environment
Install required tools and dependencies
Run initial tests on sample climate documents
Explore tool functionalities and outputs
Document your observations and progress
π± What Youβll Gain
Experience working with real climate data
Skills in text mining, NLP, and knowledge extraction
Exposure to semantic web and knowledge graph concepts
Hands-on experience with collaborative and research workflows
π§ Internship Workflow
π’ PHASE 0: ONBOARDING
Set up your basic development environment:
Install python and pip
Install git and get an ID on GitHub
install pytest
be able to use a text editor (Notepad++, TextMate, or many others NOT Word)
have an IDE (VSCode , PyCharm , etc.). Go for the freely accessible community editions.
learn the basics of Markdown
π§ͺ PHASE 1: UNIT TESTING (amilib)
Learn to run tests on a real project.
Cloning amilib repository from github and running pytest from commandline/terminal
Open your terminal
Create a working directory:
mkdir myproject
cd myproject
Clone the repository
git clone https://github.com/petermr/amilib.git
Navigate into the folder:
cd amilib
Check available branches:
git branch -a
Switch to the required branch:
git checkout [branch-name]
Pull latest updates:
git pull
Run all the test:
pytest
Report any issues on the GitHub repository issue section .
π PHASE 2: PROGRESSING
Apply the same workflow used in Phase 1 to other repositories
Clone β Setup β Run tests β Analyze results
Explore different tools and understand their functionalities
Document errors, observations, and learning outcomes
Gradually work independently with minimal guidance
π§ PHASE 3: LEARNING GIT COMMANDS
Essential Git Commands
Command
Description
git clone <url>
Copy a remote repository to your local machine. The repository URL can be an HTTP(s) URL or an SSH URL.
git status
Check current changes and file states
git add <file>
Adds the specified file to the staging area, making it ready to be committed.
git commit -m "message"
Save changes with a message
git push origin <branch>
Uploads local branch commits to the remote repository branch.
git pull origin <branch>
Fetches and integrates changes from the remote repository to your local working directory.
git branch
List or create branches
git checkout <branch>
Switches to the specified branch.
git stash
Temporarily save changes
π Learn how to create a new branch
π Learn Regular Expressions (Regex):
π PHASE 4: CLIMATE REPORT SELECTION
Go to either IPCC official website or see the structure of the IPCC Reports from HERE
Navigate through the different working groups.
Watch the presentation video for the overview of IPCC/AR6 report.
Select a chapter from the IPCC/AR6 report that you find interesting.
Download and convert it to semantic form using amilib and amiclimate tool.
Create a dictionary specific to the selected chapter and annotating the chapter with dictionary terms.
How to use graphviz tool to make flowcharts, diagram etc.
Read about the projects done by Interns
Resources for climate knowledge
HIGH-PRIORITY:
See the resource page of #semanticClimate website.
Attend daily Zoom meetings.
Report problems to the specific github repositories as an issue with full description.
Update daily progress sheet (what I DID, what I WILL do, What's BLOCKING me ): CLICK HERE
Share weekly reports on every Friday.
Join our Slack workspace thatβs where we coordinate with team members and others.
Update your unavailability for meeting in #coordination channel of Slack.
Email to semanticclimate@gmail.com for any query.
← Back