Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(datagen): Enhance JSON Dump in SelfInstructPipeline for Non-ASCII Character Readability #1612

Merged
merged 1 commit into from
Feb 15, 2025

Conversation

coolbeevip
Copy link
Collaborator

Description

Improve the SelfInstructPipeline by setting the ensure_ascii parameter to False in the json.dump function. This change allows non-ASCII characters to be written directly, significantly enhancing the human readability of the output JSON file.

Fixes #1611

Checklist

Go over all the following points, and put an x in all the boxes that apply.

  • I have read the CONTRIBUTION guide (required)
  • I have linked this PR to an issue using the Development section on the right sidebar or by adding Fixes #issue-number in the PR description (required)
  • I have checked if any dependencies need to be added or updated in pyproject.toml and poetry.lock
  • I have updated the tests accordingly (required for a bug fix or a new feature)
  • I have updated the documentation if needed:
  • I have added examples if this is a new feature

If you are unsure about any of these, don't hesitate to ask. We are here to help!

Copy link
Member

@Wendong-Fan Wendong-Fan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @coolbeevip !

@Wendong-Fan Wendong-Fan added the enhancement New feature or request label Feb 15, 2025
@Wendong-Fan Wendong-Fan added this to the Sprint 23 milestone Feb 15, 2025
@Wendong-Fan Wendong-Fan merged commit f640fb7 into camel-ai:master Feb 15, 2025
1 of 6 checks passed
apokryphosx pushed a commit that referenced this pull request Feb 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: No status
Development

Successfully merging this pull request may close these issues.

[Feature Request] Improve JSON Dump in SelfInstructPipeline for Non-ASCII Character Readability
2 participants