Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
subtitles [2020/09/21 23:48] – andi | subtitles [2025/05/26 01:08] (current) – andi | ||
---|---|---|---|
Line 1: | Line 1: | ||
= Subtitles / Untertitel | = Subtitles / Untertitel | ||
+ | [[software: | ||
+ | |||
+ | - die klassischen Workflows über die Django-App auf c3subtitles.de ([[https:// | ||
+ | - per Whisper/ | ||
+ | |||
+ | |||
+ | Vereinfacht gesagt geht es dabei immer um zwei Tasks: | ||
+ | |||
+ | - das Publizieren der SRT/VTT Datei | ||
+ | - das Anlegen der Metadaten (aka " | ||
+ | |||
+ | |||
+ | == Publishing Gateway/API | ||
+ | |||
+ | Um sich nicht mit den Eigenheiten der c3voc Infrastrktur herumschlagen zu müssen wurde 2023/2024 eine REST API eingerichtet die es erlaubt Zusatzdaten wie Slides, Subtitles etc. rein per REST zu publizieren, | ||
+ | |||
+ | - https:// | ||
+ | - https:// | ||
+ | |||
+ | Die dazu notwendigen API Keys sind aktuell noch statisch konfiguriert, | ||
+ | |||
+ | |||
+ | == Classic workflows | ||
+ | |||
+ | Components | ||
+ | |||
+ | - [[https:// | ||
+ | * is run every 10 minutes via systemd timer [[intern: | ||
+ | |||
+ | - [subtitleStatus](https:// | ||
+ | * Dashboard mit Übersicht des Transkriptionsstatus pro Konferenz und Vortrag | ||
+ | * Workflow-Manager | ||
+ | * … | ||
+ | * schiebt fertige Untertitel-Dateien (SRT) per rsync auf mirror.selfnet.de | ||
- | === Architecture overview in context of media.ccc.de | ||
{{:: | {{:: | ||
- | <callout type=" | ||
- | * Sync-Skript von subtitlesStatus -> vocotoweb: https:// | ||
- | * SystemD Timer auf releasing | ||
- | * https:// | ||
- | * Dashboard mit Übersicht des Transkriptionsstatus pro Konferenz und Vortrag | ||
- | * Workflow-Manager | ||
- | * … | ||
- | * schiebt fertige Untertitel-Dateien (SRT) per rsync auf mirror.selfnet.de | ||
- | == Communication | + | === CSV-Export from C3Subtitles: |
- | * Web: [[https:// | + | https:// |
- | * Twitter: www.twitter.com/c3subtitles | + | |
- | * Mailinglist: | + | |
- | * Mailinglist: subtitles -at- lists.ccc.de | + | |
- | * IRC: [[irc://irc.hackint.org: | + | |
- | * Etherpad-Domain: | + | |
- | * Jabber: c3subtitles -!-at-!- jabber.ccc.de | + | |
- | * Videos on amara.org : [[http:// | + | |
- | * E-Mail: subtitles -!-at-!- c3voc.de | + | |
- | == What is our goal? | + | Example: |
- | Better and more barrierfree access to the live talks and streams and to the videos | + | ^ `GUID` ^ `complete` ^ `media_language` ^ `srt_language` ^ `last_changed_on_amara` ^ `revision` ^ `url` ^ `touched` ^ `amara_key` ^ `amara_language` ^ `state` ^ `amara_subtitle_url` ^ |
+ | | db11e86c-ecf8-40c0-b8d8-2f6798507146 | False | eng | en | 1-01-01T00: | ||
+ | | 221560a2-7470-4e90-9190-99a2bef53238 | False | deu | de | 1-01-01T00: | ||
+ | | 6beabddc-2dd6-43d2-9936-618d41d42cde | True | deu | de | 1-01-01T00: | ||
+ | | 7bdf7688-8620-4170-93bf-3c2adfd30030 | False | deu | de | 1-01-01T00: | ||
+ | | 52ce1398-fa9b-4bd3-aa9e-6a49a764ac2c | True | deu | de | 1-01-01T00: | ||
+ | | 1cff41a8-455e-42a6-ab08-d6cb166e7d3b | False | deu | de | 1-01-01T00: | ||
- | Nice side effect: finished subtitles are pretty easy to translate in any other language, [[http:// | ||
+ | To download the raw (draft) subtitles from Amara, append use `https:// | ||
- | == How can I help? | ||
- | * If you visit the congress and are a user of a speech recognition software, please contact us! Also if you are a computer stenography writer or a good touch typist. | + | ==== States |
- | * If you are interested in what we are working on behind the scenes, just contact us! | + | |
- | * Help us creating the subtitles via amara.org - you do not even have to visit the congress to do that! Everybody from at home can do that! | + | |
- | == What are our current projects behind the scenes? | + | For voctoweb (media.ccc.de) only states 7, 8 and 12 are relevant. Subtitle files in all other states should be ignored. |
+ | {{tablelayout?colwidth="" | ||
+ | ^ ID ^ voctoweb | ||
+ | | 1 | ||
+ | | 2 | todo | Transcribed until | should exist | | ||
+ | | 3 | ||
+ | | 4 | ||
+ | | 5 | ||
+ | | 6 | ||
+ | | 7 | draft | Quality control done until | with timestamps, usable as draft | | ||
+ | | 8 | complete | ||
+ | | 9 | ||
+ | | 11 | todo | Translated until | translation, | ||
+ | | 12 | translated | ||
- | * Devoloping software for a user interface to choose which subtitle you want to work on depending on your favorite task | ||
- | * Developing software for subtitles via computer stenography or speech recognition, | ||
- | * Developing a phonetic german steno keyboard layout | ||
- | * Building a steno keyboard | ||
- | * Using an old mechanical stenographer with a micro controller to detect the pressed keys as steno input | ||
- | [[intern:subtitles|Intern]] | + | == Communication |
+ | |||
+ | <callout type=" | ||
+ |