Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
subtitles [2021/02/20 19:12] – andi | subtitles [2025/05/26 01:08] (current) – andi | ||
---|---|---|---|
Line 1: | Line 1: | ||
= Subtitles / Untertitel | = Subtitles / Untertitel | ||
+ | [[software: | ||
+ | |||
+ | - die klassischen Workflows über die Django-App auf c3subtitles.de ([[https:// | ||
+ | - per Whisper/ | ||
+ | |||
+ | |||
+ | Vereinfacht gesagt geht es dabei immer um zwei Tasks: | ||
+ | |||
+ | - das Publizieren der SRT/VTT Datei | ||
+ | - das Anlegen der Metadaten (aka " | ||
+ | |||
+ | |||
+ | == Publishing Gateway/API | ||
+ | |||
+ | Um sich nicht mit den Eigenheiten der c3voc Infrastrktur herumschlagen zu müssen wurde 2023/2024 eine REST API eingerichtet die es erlaubt Zusatzdaten wie Slides, Subtitles etc. rein per REST zu publizieren, | ||
+ | |||
+ | - https:// | ||
+ | - https:// | ||
+ | |||
+ | Die dazu notwendigen API Keys sind aktuell noch statisch konfiguriert, | ||
+ | |||
+ | |||
+ | == Classic workflows | ||
+ | |||
+ | Components | ||
+ | |||
+ | - [[https:// | ||
+ | * is run every 10 minutes via systemd timer [[intern: | ||
+ | |||
+ | - [subtitleStatus](https:// | ||
+ | * Dashboard mit Übersicht des Transkriptionsstatus pro Konferenz und Vortrag | ||
+ | * Workflow-Manager | ||
+ | * … | ||
+ | * schiebt fertige Untertitel-Dateien (SRT) per rsync auf mirror.selfnet.de | ||
- | === Architecture overview in context of media.ccc.de | ||
{{:: | {{:: | ||
- | * Sync-Skript von subtitlesStatus -> vocotoweb: https:// | ||
- | * systemd Timer auf [[intern: | ||
- | * https:// | ||
- | * Dashboard mit Übersicht des Transkriptionsstatus pro Konferenz und Vortrag | ||
- | * Workflow-Manager | ||
- | * … | ||
- | * schiebt fertige Untertitel-Dateien (SRT) per rsync auf mirror.selfnet.de | ||
Line 36: | Line 62: | ||
For voctoweb (media.ccc.de) only states 7, 8 and 12 are relevant. Subtitle files in all other states should be ignored. | For voctoweb (media.ccc.de) only states 7, 8 and 12 are relevant. Subtitle files in all other states should be ignored. | ||
- | + | {{tablelayout? | |
- | ^ ID ^ c3subtitles ^ additional information ^ | + | ^ ID ^ voctoweb |
- | | 1 | Nothing available yet | irrelevant should not exist | | + | | 1 | | Nothing available yet | irrelevant should not exist |
- | | 2 | Transcribed until | should exist | | + | | 2 | todo |
- | | 3 | Transcript finished | might exist - still no timestamps | | + | | 3 | | Transcript finished |
- | | 4 | Please do not touch, work in progress | Autotiming in process no timestamps | | + | | 4 | | Please do not touch, work in progress |
- | | 5 | Synced until | rare case of syncing by hand | | + | | 5 | | Synced until |
- | | 6 | Syncing finished | with timestamps, usable as draft | | + | | 6 | | Syncing finished |
- | | 7 | Quality control done until | with timestamps, usable as draft | | + | | 7 | draft | Quality control done until |
- | | 8 | Job completed | finished, obviously with timestamps and usable | | + | | 8 | complete |
- | | 9 | Unknown | should not exist | | + | | 9 | | Unknown |
- | | 11 | Translated until | translation, | + | | 11 | todo |
- | | 12 | Translation is finished | finished, obviously with timestamps and usable | | + | | 12 | translated |
Line 55: | Line 81: | ||
<callout type=" | <callout type=" | ||
- | |||
- | * Web: [[https:// | ||
- | * Twitter: twitter.com/ | ||
- | * Mailinglist: | ||
- | * Mailinglist: | ||
- | * IRC: [[irc:// | ||
- | * Etherpad-Domain: | ||
- | * Jabber: c3subtitles -!-at-!- jabber.ccc.de | ||
- | * Videos on amara.org : [[http:// | ||
- | * E-Mail: subtitles -!-at-!- c3voc.de | ||
- | |||
- | |||
- | |||
- | == What is our goal? | ||
- | |||
- | Better and more barrierfree access to the live talks and streams and to the videos afterwards via subtitles. Especially for non-natives of the spoken languages and for deaf and hard of hearing listeners. | ||
- | |||
- | Nice side effect: finished subtitles are pretty easy to translate in any other language, [[http:// | ||
- | |||
- | |||
- | == How can I help? | ||
- | |||
- | * If you visit the congress and are a user of a speech recognition software, please contact us! Also if you are a computer stenography writer or a good touch typist. | ||
- | * If you are interested in what we are working on behind the scenes, just contact us! | ||
- | * Help us creating the subtitles via amara.org - you do not even have to visit the congress to do that! Everybody from at home can do that! | ||
- | |||
- | |||
- | == What are our current projects behind the scenes? | ||
- | |||
- | * Devoloping software for a user interface to choose which subtitle you want to work on depending on your favorite task | ||
- | * Developing software for subtitles via computer stenography or speech recognition, | ||
- | * Developing a phonetic german steno keyboard layout | ||
- | * Building a steno keyboard | ||
- | * Using an old mechanical stenographer with a micro controller to detect the pressed keys as steno input | ||
- | |||
- | [[intern: |