Location: Canberra, ACT, AU Brisbane, QLD, AU Melbourne, VIC, AU
Company: CSIRO
The Opportunity
- Do you want to apply your data skills to be part of science and research at CSIRO?
- Are you passionate about open-source software and open data?
- Would you like to work on international collaborations?
The Atlas of Living Australia (ALA) is Australia’s largest national biodiversity data source funded by the National Collaborative Research Infrastructure Strategy (NCRIS) and hosted by CSIRO. The ALA is the Australian node of the Global Biodiversity Information Facility (GBIF). Our digital infrastructure is developed in-house to support research activities, government decision-making and community events.
As part of this program the ALA wishes to appoint a Data Analyst to work on data acquisition, transformation, loading, integration and quality assurance for a 1.5 year opportunity based at CSIRO in Canberra. Our team is technically oriented and uses multiple technologies and platforms to explore and manipulate large datasets into a standardised format, which we then ingest through our processing pipeline.
Your duties will include:
As the successful candidate you will develop new and support existing automated jobs to harvest data from a series of data providers including national and international data repositories, ensuring data currency and quality is consistent with expectations.
- Report to the EcoCommons Program Manager and Technical Lead to build and manage both automated and manual data loading processes and specifically focus on better integration of ALA-provided data into EcoCommons
- Architect a framework for the data lifecycle: from ingestion to processing to search to outputs in scientific workflows and analysis pipeline
- Assist in providing advice on engineering a pipeline for data ingestion and processing (automate as much as possible) to ensure dataset updates + additions are sustainable by the dev team
- Create guidelines for data management (incl. metadata, updates, criteria for inclusion.)
- Map datasets to required data standards (e.g. Darwin Core, Darwin Event Core, Humboldt Core).
- Implement, deploy, schedule, and maintain data load processes.
- Implement quality assurance and verification on datasets to ensure loaded records meet expectations
Location: Canberra, Melbourne, or Brisbane preferred
Salary: AU$102,724 to AU$111,165 pa (pro-rata for part-time) + up to 15.4% superannuation
Tenure: Specified term of 2 years
Reference: 78562
To be considered you will need:
Essential
- Strong knowledge of scripting languages in a command line environment – Python or R
- Experience in both delivering and consuming REST services
- Strong (extract, transform, load) ETL skills with large datasets with a focus on efficiency and scale
- Experience with a variety of open-source relational and non-relational databases
- Source code management using git, svn, or Bitbucket
- Effective stakeholder engagement and technical liaison skills
Desirable
- Experience with geospatial data systems and development
- Experience in processing species data (occupancy and/or abundance) as well as data on environmental variables (e.g. rainfall, temperature, soil characteristics etc.)
- Background or strong interest in biodiversity/ecology/taxonomy
- Enthusiasm and knowledge of open data standards, procedures and policy
- Experience with Darwin Core standard
- Experience with Apache Airflow
CSIRO is an Equal Opportunity employer working hard to recruit world-class talent that represents the diversity across our society.
For full details about this role please review the Position Description
Eligibility
To be eligible to work in CSIRO you must be an Australian Citizen, Permanent Resident or either hold, or be able to obtain, a valid working visa.
The successful applicant will be required to obtain and provide a National Police Check or equivalent. Additional integrity checks may be required for specific roles which require security clearance for working with children, Australian Government cybersecurity requirements or other identified security roles.
Flexible Working Arrangements
We work flexibly at CSIRO, offering a range of options for how, when and where you work.
Diversity and Inclusion
We are working hard to recruit people representing the diversity across our society, and ensure that all our people feel supported to do their best work and feel empowered to let their ideas flourish.
About CSIRO
At CSIRO Australia’s national science agency, we solve the greatest challenges through innovative science and technology. We put the safety and wellbeing of our people above all else and earn trust everywhere because we only deal in facts. We collaborate widely and generously and deliver solutions with real impact.
Join us and start creating tomorrow today!
How to Apply
Please apply on-line and provide a cover letter and CV that best demonstrate your motivation and ability to meet the requirements of this role.
Applications Close
7th November 2021, 11:00pm AEST