VOCSTATS Data Architecture¶
RTOpacks Knowledge Base — Data Layer Reference¶
Last updated: March 2026
Author: UCCA
Access: Internal reference — not for public distribution
Overview¶
RTOpacks integrates five VOCSTATS collections provided by the National Centre for Vocational Education Research (NCVER) under registered data access (account: RignoldT). All data is accessed via the SuperWEB2 Open Data REST API.
API base: https://vocstats.ncver.edu.au/webapi/rest/v1/
Authentication: APIKey header (stored in Cloudflare Worker secret VOCSTATS_API_KEY)
Data governance: NCVER data access terms. Attribution required on all public display. Perturbation applied to all cell values for confidentiality.
The Five Collections¶
1. Total VET students and courses (TVA)¶
What it is: The definitive national dataset of all VET activity delivered by Australian RTOs. Includes TAFE institutes, universities, community education providers, enterprise providers, private providers, schools, and Australian RTOs delivering overseas. Excludes non-nationally recognised training and credit transfer.
Tables available:
- TVA program enrolments 2015–2024 — enrolments by qualification × year
- TVA program completions 2015–2024 — completions by qualification × year
- TVA subject enrolments 2015–2024 — unit/subject level enrolments
rtopacks-db tables:
- vocstats_enrolments — qual_code, collection_year, enrolments
- vocstats_completions — qual_code, collection_year, completions
Join key: qual_code → qualifications.qual_code
Release cadence: Annual. NCVER publishes updated TVA data mid-year for the previous calendar year. Data back-cast to 2015.
Important note: VOCSTATS no longer provides a Total VET students database (de-duplicated student counts). Student counts require the DataBuilder product or customised data request. What we have is program/subject enrolment counts — these are activity records, not unique student counts.
Qual page display: - Most recent year enrolment count: "X enrolments nationally (2024)" - 5-year sparkline: 2020–2024 trend - Completion rate: completions ÷ enrolments × 100
2. Government-funded students and courses¶
What it is: Subset of TVA covering only Commonwealth and state/territory government-funded training. Excludes all fee-for-service activity. Useful for understanding the funded training market specifically.
Tables available: To be explored via /schema
rtopacks-db table: vocstats_govt_enrolments (future)
Release cadence: Quarterly (Jan–Mar, Jan–Jun, Jan–Sep, Jan–Dec) plus annual. Note: Q1 2025 publication delayed due to data quality issues — quarterly will resume with Jan–Jun 2025.
Qual page display: "X government-funded enrolments (2024)" — useful signal for RTOs assessing funding viability of a qualification.
3. Student outcomes¶
What it is: Annual survey of VET graduates and module completers conducted 6 months after completing training. Measures employment outcomes, further study, overall satisfaction, and reason for training.
Tables available: allusers
rtopacks-db table: vocstats_outcomes — qual_code (or foe_code/aqf_level), survey_year, outcome_type, outcome_pct
Important caveat: Student outcomes data may only be available at Field of Education (FOE) or AQF level, not individual qualification code level. If so, join to quals via FOE code and display as "X% of graduates in this field found employment" with FOE attribution.
Join key: qual_code or foe_code or aqf_level
Release cadence: Annual. Survey conducted mid-year, results published late in the same year.
Qual page display: - "X% employed after training (survey year)" - "X% went on to further study" - "X% satisfied overall" These complement the VNDA data already displayed — VNDA covers employment/income outcomes, Student Outcomes covers satisfaction and reasons for training.
4. Apprentices and trainees¶
What it is: Quarterly data on Australian apprenticeship and traineeship training contracts. Covers commencements, completions, cancellations, and in-training counts by occupation (ANZSCO), industry (ANZSIC), state, age, gender, contract type.
Important note on in-training figures: Cannot be cumulated — represent active contracts at the end of a quarter only. Use commencements or completions for trend analysis.
rtopacks-db table: vocstats_apprentices — anzsco_code, collection_year, collection_quarter, state, contract_type, commencements, completions, in_training
Join key: anzsco_code → qualifications.anzsco_code
Release cadence: Quarterly. Each quarter published approximately 3 months after the reference period.
Covid caveat: 2020 onwards data significantly disrupted. All display should note: "Data from 2020 reflects COVID-19 disruption to apprenticeship and traineeship activity."
Qual page display: "X apprenticeships/traineeships commenced in this occupation (2024)" — particularly relevant for trades and technical qualifications (CPC, MEM, UEE, AUR etc.)
5. VET in Schools (VIS) ← Strategic priority¶
What it is: Data on secondary school students (Years 10–12) enrolled in nationally recognised VET qualifications as part of their senior secondary certificate. One of the most data-rich and commercially underserved datasets in the entire VET ecosystem.
Data depth: Back to 2006 — 18+ years of longitudinal data. No commercial platform currently surfaces this at qualification level.
Tables available:
- ViS program enrolments 2006–2024
- ViS program completions 2006–2024
- ViS students 2006–2024
- ViS subject enrolments 2006–2023
rtopacks-db table: vocstats_vis_enrolments — qual_code, collection_year, state, school_sector, enrolments, completions
Join key: qual_code → qualifications.qual_code
Release cadence: Annual. Published mid-year for previous calendar year.
High-VIS training packages: SIT (hospitality), CHC (community services/early childhood), ICT (technology), CPC (construction), HLT (health), AUR (automotive). These will have the most data.
Qual page display: - Distinct "Delivered in secondary schools" badge on qualifying quals - "X secondary school students enrolled nationally (2024)" - State breakdown available for deep dive
Strategic value: RTOs delivering VIS programmes have zero intelligence tooling. This is a genuine market gap. Surface it prominently — it signals sector depth that no competitor can match.
Data Freshness & Refresh Schedule¶
| Collection | DB Tables | Release cadence | Refresh trigger |
|---|---|---|---|
| TVA enrolments | vocstats_enrolments | Annual (mid-year) | Cron: 1 July |
| TVA completions | vocstats_completions | Annual (mid-year) | Cron: 1 July |
| Govt-funded | vocstats_govt_enrolments | Quarterly + annual | Cron: quarterly |
| Student outcomes | vocstats_outcomes | Annual (late year) | Cron: 1 November |
| Apprentices & trainees | vocstats_apprentices | Quarterly | Cron: quarterly |
| VET in Schools | vocstats_vis_enrolments | Annual (mid-year) | Cron: 1 July |
All refresh Workers send completion notification to admin@ucca.com.au.
Display Attribution (mandatory on all public surfaces)¶
All VOCSTATS data displayed on rtopacks.com.au must include:
Source: NCVER VOCSTATS [Collection name] · Figures are perturbed statistical estimates · [Year]
With a link to ncver.edu.au. This is already the pattern established for VNDA data.
What VOCSTATS is NOT¶
- Not a real-time feed — data is published annually or quarterly with significant lag
- Not individual student records — aggregated counts only
- Not a mirror of training.gov.au — it is a statistical reporting system, not a regulatory register
- Not a unique UCCA dataset — it is publicly available to any registered user. UCCA's value is in the enrichment, not the source data.