You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please let me know if you need a reprex, but I notice that when extracting tables using docx_summary, any hyphens in words are removed, e.g. "Inspector-General" becomes "InspectorGeneral".
This occurred when extracting tables from the Australian Government Budget at this link.
Thanks to @elipousson for pointing out that this is probably due to docx_summary not the {officerExtras} package.
The text was updated successfully, but these errors were encountered:
trekonom
added a commit
to trekonom/officer
that referenced
this issue
May 18, 2024
The issue is that the hyphens in your table are non breaking hyphens which are added as a <noBreakHyphen> element. As a result these hyphens get dropped in docx_summary as it only extracts text. I just added a PR with a possible fix. See #575 .
This old thread has been automatically locked. If you think you have found something related to this, please open a new issue and link to this old issue if necessary.
Please let me know if you need a reprex, but I notice that when extracting tables using docx_summary, any hyphens in words are removed, e.g. "Inspector-General" becomes "InspectorGeneral".
This occurred when extracting tables from the Australian Government Budget at this link.
Thanks to @elipousson for pointing out that this is probably due to docx_summary not the {officerExtras} package.
The text was updated successfully, but these errors were encountered: