Skip to content

feat add speech reply#95

Open
yuluo1007 wants to merge 30 commits intoagentscope-ai:mainfrom
yuluo1007:client_speech_lsd
Open

feat add speech reply#95
yuluo1007 wants to merge 30 commits intoagentscope-ai:mainfrom
yuluo1007:client_speech_lsd

Conversation

@yuluo1007
Copy link
Collaborator

Description

[Please describe the background, purpose, changes made, and how to test this PR]

Checklist

Please check the following items before code is ready to be reviewed.

  • Code has been formatted with npm run format command in the root directory
  • Docstrings/comments are in appropriate style (TSDoc/JSDoc for TypeScript/JavaScript, Google style for Python)
  • Related documentation has been updated (e.g. links, examples, etc.)
  • Code is ready for review

@yuluo1007 yuluo1007 requested a review from zhijianma December 30, 2025 03:29

const handlePlaySpeech = useCallback(
(replyId: string) => {
// 停止其他正在播放的语音
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

去除中文注释

@yuluo1007 yuluo1007 changed the title feat speech answer feat add speech reply Jan 13, 2026
@zhijianma zhijianma requested a review from a team January 23, 2026 03:05
@zhijianma zhijianma requested a review from a team January 23, 2026 03:05
@zhijianma
Copy link
Member

/gemini review

@zhijianma zhijianma removed the request for review from a team January 23, 2026 03:06
Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces a significant new feature for speech playback of replies, including streaming audio, playback controls (play/pause, speed, volume), and auto-play functionality. The implementation is extensive, touching both client and server-side code. The client-side implementation in RunRoomContext.tsx is particularly complex, handling audio decoding and playback for both streaming and full-replay scenarios. While the overall approach is solid, I've identified a few issues, including a critical type error and some confusing or incomplete logic in the audio state management that should be addressed. The backend changes to support this feature are straightforward and look correct.

@yuluo1007
Copy link
Collaborator Author

/gemini review

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces speech reply functionality, allowing for audio playback of messages. It includes a new SpeechBar component, integrates speech state management into the RunRoomContext, and adds UI controls for playback rate, volume, and autoplay. The changes also involve updating data models and API endpoints to support speech data. Overall, the implementation is comprehensive, but there are a couple of areas that could be improved for correctness and accuracy.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants