-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some records being assigned entries in CR_SUBJECT #121
Comments
@ibrahimjaved12 can you take a look at this? I looked at a couple of courses (e.g. 11.255 and 15.760a) and couldn't find any commonalities. |
@pdpinch It looks like there are two
The Issue and fixCurrently, we are relying on the first So the courses themselves are fine, it's that their ocw topics are being ignored. Can you please confirm this, so I can go ahead to creating a PR for this that uses Some additional analysis@pmusser I also performed an analysis of the OCW topics that might be useful. In flattened structure, we have 335 OCW topics. And we have 328 OCW topics to OER Subject in the mapping file. These are the OCW Topics that are missing OER Subject: African American Studies (15 published courses) However, these are 2nd-level subtopics or 3rd-level specialties in the unflattened hierarchy (OCW topics hierarchy follows the structure: topic --> subtopic --> speciality). Which means, their OER subject would also be populated by their parent category. So in essence, they should be populated, but regardless I wanted to share this in case we would like to have more specific subjects for them. OCW Topics Hierarchy for Missing Topics:
Analysis FileFor a complete list of OCW courses and their mappings, please see this analysis file: All OCW Topics Hierarchy fileIn any case, this is the list of the 52 courses that do not have a |
@ibrahimjaved12 If I recall correctly, the initial mapping for this app was written by a member of the OCW team, using ocw_topics_hierarchy.txt and the "Subject Area" values from OER Commons Search. Sometime later, the MIT Learn site established a new (simpler) topic hierarchy, which replaced the values in The original OCW topics should be available in the MIT Learn API via I'm not sure which topic hierarchy to use for OER mapping. The current plan is to keep both. Given that we already have a mapping from OCW Topics to OER Commons Subject Areas, the simpler solution is to complete that mapping. Can you identify which values have been overlooked, and I can refer them to the OCW team? |
I created a new test CSV using values from However, there are missing values in the CSV for these two courses:
Since both
Please note the current logic:
Please let me know how you'd like to proceed. |
Expected Behavior
When ocw_oer_export.csv generates, all entries should have a value in the CR_SUBJECT field.
Current Behavior
On the most recent export, 52 rows do not have a value in CR_SUBJECT.
Steps to Reproduce
Possible Solution
I had a casual look through the records to see if there was some common thread (e.g. publish time or obvious subject that isn't mapping) but nothing jumped out at me. If someone can give me an export either as a csv or even a comment attached to this issue containing the course name, URL, and OCW subject headings for the entries that haven't been mapped, I can give mappings for them.
The text was updated successfully, but these errors were encountered: