Differences

This shows you the differences between two versions of the page.

--- subtitles [2021/02/20 19:12] – andi
+++ subtitles [2025/05/26 01:08] (current) – andi
@@ Line 1: / Line 1: @@
 = Subtitles / Untertitel
+[[software:voctoweb]] hat mehr aktuell zwei Varianten über die publizierte Vorträge Untertitel bekommen können:
+- die klassischen Workflows über die Django-App auf c3subtitles.de ([[https://github.com/c3subtitles/subtitleStatus|subtitleStatus]])
+- per Whisper/[[https://github.com/bugbakery/transcribee|Transcribee]] über die [[https://publishing.c3voc.de/docs|Publishing API]]
+Vereinfacht gesagt geht es dabei immer um zwei Tasks:
+- das Publizieren der SRT/VTT Datei
+- das Anlegen der Metadaten (aka "Subtitle Recording") in voctoweb, damit die Untertitel auch im Player angezeigt werden
+== Publishing Gateway/API
+Um sich nicht mit den Eigenheiten der c3voc Infrastrktur herumschlagen zu müssen wurde 2023/2024 eine REST API eingerichtet die es erlaubt Zusatzdaten wie Slides, Subtitles etc. rein per REST zu publizieren, ohne das SFTP o.ä. verwendet werden muss:
+- https://publishing.c3voc.de/docs
+- https://github.com/voc/publishing-gw
+Die dazu notwendigen API Keys sind aktuell noch statisch konfiguriert, sollen aber perspektivisch an https://sso.c3voc.de/ angebunden werden.
+== Classic workflows
+Components
+- [[https://github.com/voc/scripts/blob/master/subtitles/sync_media_recordings.py|sync_media_recordings.py]] uses CSV export from `subtitlesStatus` to publish subtiles in [[vocotoweb]] via REST API:
+   * is run every 10 minutes via systemd timer [[intern:server:releasing.c3voc.de]]`:/etc/systemd/system/publish-subtitles.timer`
+- [subtitleStatus](https://github.com/c3subtitles/subtitleStatus) (Django-App @ c3subtitles.de)
+   * Dashboard mit Übersicht des Transkriptionsstatus pro Konferenz und Vortrag
+   * Workflow-Manager
+   * …
+   * schiebt fertige Untertitel-Dateien (SRT) per rsync auf mirror.selfnet.de
-=== Architecture overview in context of media.ccc.de
 {{::docs:architecture-overview-subtitles.png?600|}}
-* Sync-Skript von subtitlesStatus -> vocotoweb: https://github.com/voc/scripts/blob/master/subtitles/sync_media_recordings.py
-  * systemd Timer auf [[intern:server:releasing.c3voc.de]]
-* https://github.com/c3subtitles/subtitleStatus
-  * Dashboard mit Übersicht des Transkriptionsstatus pro Konferenz und Vortrag
-  * Workflow-Manager
-  * …
-  * schiebt fertige Untertitel-Dateien (SRT) per rsync auf mirror.selfnet.de
@@ Line 36: / Line 62: @@
 For voctoweb (media.ccc.de) only states 7, 8 and 12 are relevant. Subtitle files in all other states should be ignored.
+{{tablelayout?colwidth=""}}
-^ ID ^ c3subtitles ^ additional information ^
+^ ID  ^ voctoweb    ^ c3subtitles                            ^ additional information                          ^
-| 1 | Nothing available yet | irrelevant should not exist |
+| 1   |             | Nothing available yet                  | irrelevant should not exist                     |
-| 2 | Transcribed until | should exist |
+| 2   | todo        | Transcribed until                      | should exist                                    |
-| 3 | Transcript finished | might exist - still no timestamps |
+| 3   |             | Transcript finished                    | might exist - still no timestamps               |
-| 4 | Please do not touch, work in progress | Autotiming in process no timestamps |
+| 4   |             | Please do not touch, work in progress  | Autotiming in process no timestamps             |
-| 5 | Synced until | rare case of syncing by hand |
+| 5   |             | Synced until                           | rare case of syncing by hand                    |
-| 6 | Syncing finished | with timestamps, usable as draft |
+| 6   |             | Syncing finished                       | with timestamps, usable as draft                |
-| 7 | Quality control done until | with timestamps, usable as draft |
+| 7   | draft       | Quality control done until             | with timestamps, usable as draft                |
-| 8 | Job completed | finished, obviously with timestamps and usable |
+| 8   | complete    | Job completed                          | finished, obviously with timestamps and usable  |
-| 9 | Unknown | should not exist |
+| 9   |             | Unknown                                | should not exist                                |
-| 11 | Translated until | translation, not usable as draft |
+| 11  | todo        | Translated until                       | translation, not usable as draft                |
-| 12 | Translation is finished | finished, obviously with timestamps and usable |
+| 12  | translated  | Translation is finished                | finished, obviously with timestamps and usable  |
@@ Line 55: / Line 81: @@
 <callout type="warning">Achtung: Der Rest dieser Seite wird ggf. nicht mehr aktiv gepflegt. Das Team hat inzwischen eigene Infrastruktur: https://wiki.c3subtitles.de</callout>
-   * Web: [[https://c3subtitles.de/]]
-* Twitter: twitter.com/c3subtitles
-* Mailinglist: subtitles-angels -at- lists.selfnet.de
-* Mailinglist: subtitles -at- lists.ccc.de
-* IRC: [[irc://irc.hackint.org:9999/#subtitles|#subtitles]] auf hackint. **Requires SSL**. - but also the #voc channel
-* Etherpad-Domain: [[https://subtitles.pads.ccc.de]]
-* Jabber: c3subtitles -!-at-!- jabber.ccc.de
-* Videos on amara.org : [[http://www.amara.org/en/profiles/videos/c3subtitles/|c3subtitles videos on amara.org]]
-* E-Mail: subtitles -!-at-!- c3voc.de
-== What is our goal?
-Better and more barrierfree access to the live talks and streams and to the videos afterwards via subtitles. Especially for non-natives of the spoken languages and for deaf and hard of hearing listeners.
-Nice side effect: finished subtitles are pretty easy to translate in any other language, [[http://www.amara.org/en/profiles/videos/c3subtitles/|amara.org]] also provides a very easy usable interface for that purpose.
-== How can I help?
-  * If you visit the congress and are a user of a speech recognition software, please contact us! Also if you are a computer stenography writer or a good touch typist.
-  * If you are interested in what we are working on behind the scenes, just contact us!
-  * Help us creating the subtitles via amara.org - you do not even have to visit the congress to do that! Everybody from at home can do that!
-== What are our current projects behind the scenes?
-  * Devoloping software for a user interface to choose which subtitle you want to work on depending on your favorite task
-  * Developing software for subtitles via computer stenography or speech recognition, visible live in the talk via webstream and later as start for the precise version to work on in [[http://www.amara.org/de/profiles/profile/162037/|amara.org]]
-  * Developing a phonetic german steno keyboard layout
-  * Building a steno keyboard
-  * Using an old mechanical stenographer with a micro controller to detect the pressed keys as steno input
-[[intern:subtitles|Intern]]