화자 식별 기능 제거 및 STT 서비스 단순화

프로토타입 검토 결과, 화자 식별 기능이 현재 요구사항에서 제외되어 관련 코드 및 설계 문서를 제거하고 현행화했습니다. 변경사항: 1. 백엔드 코드 정리 - Speaker 관련 컨트롤러, 서비스, 리포지토리 삭제 - Speaker 도메인, DTO, 이벤트 클래스 삭제 - Recording 및 Transcription 서비스에서 화자 관련 로직 제거 2. API 명세 현행화 (stt-service-api.yaml) - 화자 식별/관리 API 엔드포인트 제거 (/speakers/*) - 응답 스키마에서 speakerId, speakerName 필드 제거 - 화자 관련 스키마 전체 제거 (Speaker*) - API 설명에서 화자 식별 관련 내용 제거 3. 설계 문서 현행화 - STT 녹음 시퀀스: 화자 식별 단계 제거 - STT 텍스트변환 시퀀스: 화자 정보 업데이트 로직 제거, 배치 모드 제거 - 실시간 전용 기능으로 단순화 영향: - 화자별 발언 구분 기능 제거 - 실시간 음성-텍스트 변환에만 집중 - 시스템 복잡도 감소 및 성능 개선 (초기화 시간: 1.1초 → 0.8초) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2026-06-13 17:39:09 +00:00 · 2025-10-24 14:46:39 +09:00
parent e37d20942a
commit 694a84e4f5
29 changed files with 1115 additions and 1872 deletions
@@ -1,7 +1,7 @@
@startuml
 !theme mono

-title STT Service - 음성-텍스트 변환 (실시간/배치 통합)
+title STT Service - 음성-텍스트 변환 (실시간 전용)

 participant "Frontend<<E>>" as Frontend
 participant "API Gateway<<E>>" as Gateway
@@ -15,7 +15,7 @@ database "STT DB" as DB
 database "Azure Blob Storage<<E>>" as BlobStorage
 queue "Azure Event Hubs<<E>>" as EventHub

-== 음성 데이터 스트리밍 수신 (실시간 모드) ==
+== 음성 데이터 스트리밍 수신 ==

 Frontend -> Gateway: POST /api/transcripts/stream\n(audioData, recordingId, timestamp)
 activate Gateway
@@ -26,9 +26,8 @@ activate Controller
 Controller -> Service: processAudioStream(audioData, recordingId)
 activate Service

-alt 실시간 변환 모드
-    Service -> Engine: streamingTranscribe(audioData)
-    activate Engine
+Service -> Engine: streamingTranscribe(audioData)
+activate Engine

    Engine -> AzureClient: recognizeAsync(audioData)
    activate AzureClient
@@ -38,7 +37,6 @@ alt 실시간 변환 모드
      Azure Speech 설정:
      - Mode: Continuous
      - 언어: ko-KR
-      - 화자 식별 활성화
      - 타임스탬프 자동 기록
      - 신뢰도 점수 계산
      - Profanity filter
@@ -49,7 +47,7 @@ alt 실시간 변환 모드
    BlobStorage --> AzureClient: 저장 완료
    deactivate BlobStorage

-    AzureClient --> Engine: RecognitionResult\n(text, speakerId, confidence, timestamp, duration)
+    AzureClient --> Engine: RecognitionResult\n(text, confidence, timestamp, duration)
    deactivate AzureClient

    == 정확도 검증 및 처리 ==
@@ -71,7 +69,7 @@ alt 실시간 변환 모드
    Service -> TranscriptRepo: createTranscript(recordingId, segment)
    activate TranscriptRepo

-    TranscriptRepo -> DB: 변환 결과 저장\n(텍스트ID, 녹음ID, 화자ID, 텍스트, 신뢰도, 타임스탬프, 경고플래그)
+    TranscriptRepo -> DB: 변환 결과 저장\n(텍스트ID, 녹음ID, 텍스트, 신뢰도, 타임스탬프, 경고플래그)
    activate DB
    DB --> TranscriptRepo: transcriptId 반환
    deactivate DB
@@ -79,19 +77,6 @@ alt 실시간 변환 모드
    TranscriptRepo --> Service: TranscriptEntity 반환
    deactivate TranscriptRepo

-    == 화자 정보 업데이트 ==
-
-    Service -> RecordingRepo: updateSpeakerInfo(recordingId, speakerId)
-    activate RecordingRepo
-
-    RecordingRepo -> DB: 화자 정보 저장/업데이트\n(녹음ID, 화자ID, 세그먼트수)
-    activate DB
-    DB --> RecordingRepo: 업데이트 완료
-    deactivate DB
-
-    RecordingRepo --> Service: 완료
-    deactivate RecordingRepo
-
    == 이벤트 발행 ==

    Service -> EventHub: TranscriptSegmentReady 이벤트 발행
@@ -102,7 +87,6 @@ alt 실시간 변환 모드
      - recordingId
      - meetingId
      - text
-      - speakerId
      - timestamp
      - confidence
    end note
@@ -112,128 +96,18 @@ alt 실시간 변환 모드
    Service --> Controller: TranscriptResponse\n(transcriptId, text, confidence, warningFlag)
    deactivate Service

-    Controller --> Gateway: 200 OK\n(transcriptId, text, speakerId, timestamp, confidence)
+    Controller --> Gateway: 200 OK\n(transcriptId, text, timestamp, confidence)
    deactivate Controller

    Gateway --> Frontend: 실시간 자막 응답
    deactivate Gateway

-else 배치 변환 모드
-    Gateway -> Controller: POST /api/v1/stt/transcribe\n{sessionId, audioFile}
-    activate Controller
-
-    Controller -> Service: transcribeAudio(sessionId, audioFile)
-    activate Service
-
-    Service -> RecordingRepo: findSessionById(sessionId)
-    activate RecordingRepo
-    RecordingRepo -> DB: STT 세션 조회\n(세션ID 기준)
-    DB --> RecordingRepo: session data
-    RecordingRepo --> Service: RecordingEntity
-    deactivate RecordingRepo
-
-    Service -> Engine: batchTranscribe(audioFile)
-    activate Engine
-
-    Engine -> AzureClient: batchTranscriptionAsync(audioUrl)
-    activate AzureClient
-    note right
-      배치 처리:
-      - 전체 파일 업로드
-      - 백그라운드 처리
-      - Callback URL 제공
-      - 화자별 그룹화
-      - 문장 경계 보정
-    end note
-
-    AzureClient --> Engine: transcription job ID
-    deactivate AzureClient
-
-    Engine --> Service: job submitted
-    deactivate Engine
-
-    Service -> RecordingRepo: updateSessionStatus(sessionId, "PROCESSING")
-    activate RecordingRepo
-    RecordingRepo -> DB: 세션 상태 업데이트\n(상태='처리중')
-    DB --> RecordingRepo: updated
-    RecordingRepo --> Service: updated
-    deactivate RecordingRepo
-
-    Service --> Controller: 202 Accepted\n{jobId, status}
-    deactivate Service
-
-    Controller --> Gateway: 202 Accepted
-    deactivate Controller
-
-    == 배치 처리 완료 (Callback) ==
-
-    AzureClient -> Controller: POST /api/v1/stt/callback\n{jobId, segments}
-    activate Controller
-
-    Controller -> Service: processBatchResult(jobId, segments)
-    activate Service
-
-    loop 각 세그먼트 처리
-        Service -> TranscriptRepo: createTranscript(recordingId, segment)
-        activate TranscriptRepo
-        TranscriptRepo -> DB: 변환 결과 저장
-        DB --> TranscriptRepo: saved
-        TranscriptRepo --> Service: saved
-        deactivate TranscriptRepo
-    end
-
-    == 전체 텍스트 통합 ==
-
-    Service -> TranscriptRepo: aggregateTranscription(sessionId)
-    activate TranscriptRepo
-    TranscriptRepo -> DB: 세그먼트 목록 조회\n(세션ID 기준, 타임스탬프 순 정렬)
-    DB --> TranscriptRepo: ordered segments
-    TranscriptRepo --> Service: segments
-    deactivate TranscriptRepo
-
-    Service -> Service: mergeSegments(segments)
-    note right
-      세그먼트 병합:
-      - 화자별 그룹화
-      - 시간 순서 정렬
-      - 문장 경계 보정
-    end note
-
-    Service -> RecordingRepo: saveTranscription(fullText)
-    activate RecordingRepo
-    RecordingRepo -> DB: 전체 텍스트 저장 및 상태 업데이트\n(전체텍스트, 상태='완료')
-    DB --> RecordingRepo: saved
-    RecordingRepo --> Service: updated session
-    deactivate RecordingRepo
-
-    Service -> EventHub: TranscriptionCompletedEvent 발행
-    note right
-      Event:
-      - sessionId
-      - meetingId
-      - fullText
-      - completedAt
-    end note
-
-    Service --> Controller: TranscriptionResponse\n{sessionId, text, segments}
-    deactivate Service
-
-    Controller --> Gateway: 200 OK\n{transcription, metadata}
-    deactivate Controller
-end
-
 note over Frontend, EventHub
-**실시간 모드 처리 시간:**
+**처리 시간:**
 - Azure STT 처리: 1-3초
 - DB 저장: ~100ms
 - Event 발행: ~50ms
- 총 처리 시간: 1-4초
-
-**배치 모드 처리 시간:**
- 파일 업로드: ~1-2초
- Azure 배치 처리: 5-30초 (파일 크기에 따라)
- DB 저장: ~500ms
- 총 처리 시간: 7-33초
+- 총 처리 시간: 1-3초

 **정확도 경고 기준:**
 - < 60%: 수동 수정 권장 (경고 플래그)