**This is an old revision of the document!**
Subtitles / Untertitel
Architecture overview in context of media.ccc.de
Achtung: Der Rest dieser Seite wird ggf. nicht mehr aktiv gepflegt. Das Team hat inzwischen eigene Infrastruktur: https://wiki.c3subtitles.de
- Sync-Skript von subtitlesStatus → vocotoweb: https://github.com/voc/scripts/blob/master/subtitles/sync_media_recordings.py
- systemd Timer auf releasing.c3voc.de
-
- Dashboard mit Übersicht des Transkriptionsstatus pro Konferenz und Vortrag
- Workflow-Manager
- …
- schiebt fertige Untertitel-Dateien (SRT) per rsync auf mirror.selfnet.de
CSV-Export from C3Subtitles:
https://c3subtitles.de/media_export/2020-12-30T0:00:00.99Z
Example:
GUID;complete;media_language;srt_language;last_changed_on_amara;revision;url;touched;amara_key;amara_language;state;amara_subtitle_url db11e86c-ecf8-40c0-b8d8-2f6798507146;False;eng;en;1-01-01T00:00:00Z;1;https://mirror.selfnet.de/c3subtitles/events/rc3/rc3-mcr-11546-eng-deu-Measuring_radioactivity_using_low-cost_silicon_sensors.en.srt;2021-01-03T12:30:45Z;P4gHpqpuJJIA;en;2;https://amara.org/api/videos/P4gHpqpuJJIA/languages/en/subtitles/ 221560a2-7470-4e90-9190-99a2bef53238;False;deu;de;1-01-01T00:00:00Z;1;https://mirror.selfnet.de/c3subtitles/events/rc3/rc3-mcr-11574-deu-Globalisierung_Digitalisierung_und_die_Wachstumsfrage.de.srt;2021-01-06T10:37:48Z;dSkmesksKqIe;de;2;https://amara.org/api/videos/dSkmesksKqIe/languages/de/subtitles/ 6beabddc-2dd6-43d2-9936-618d41d42cde;True;deu;de;1-01-01T00:00:00Z;5;https://mirror.selfnet.de/c3subtitles/congress/35c3/35c3-9744-deu-eng-Inside_the_Fake_Science_Factories.de.srt;2021-01-01T23:14:02Z;vsf2PBryeqW9;de;8;https://amara.org/api/videos/vsf2PBryeqW9/languages/de/subtitles/ 7bdf7688-8620-4170-93bf-3c2adfd30030;False;deu;de;1-01-01T00:00:00Z;1;https://mirror.selfnet.de/c3subtitles/congress/36c3/36c3-10983-deu-eng-Its_alive_-_Nach_den_Protesten_gegen_die_Polizeigesetze_ist_vor_den_Protesten_gegen_die_autoritaere_Wende.de.srt;2020-12-30T10:58:21Z;NWHl9bK0MXF8;de;2;https://amara.org/api/videos/NWHl9bK0MXF8/languages/de/subtitles/ 52ce1398-fa9b-4bd3-aa9e-6a49a764ac2c;True;deu;de;1-01-01T00:00:00Z;7;https://mirror.selfnet.de/c3subtitles/congress/35c3/35c3-9343-deu-eng-Court_in_the_Akten.de.srt;2021-01-04T22:17:06Z;3mDTDpROSDxZ;de;8;https://amara.org/api/videos/3mDTDpROSDxZ/languages/de/subtitles/ 1cff41a8-455e-42a6-ab08-d6cb166e7d3b;False;deu;de;1-01-01T00:00:00Z;1;https://mirror.selfnet.de/c3subtitles/congress/35c3/35c3-10036-deu-eng-Mondnacht.de.srt;2021-01-04T22:12:21Z;gAbEiRm8Mocc;de;2;https://amara.org/api/videos/gAbEiRm8Mocc/languages/de/subtitles/
States
ID | c3subtitles | additional information |
---|---|---|
1 | Nothing available yet | irrelevant should not exist |
2 | Transcribed until | should exist |
3 | Transcript finished | might exist - still no timestamps |
4 | Please do not touch, work in progress | Autotiming in process no timestamps |
5 | Synced until | rare case of syncing by hand |
6 | Syncing finished | with timestamps, usable as draft |
7 | Quality control done until | with timestamps, usable as draft |
8 | Job completed | finished, obviously with timestamps and usable |
9 | Unknown | should not exist |
11 | Translated until | translation, not usable as draft |
12 | Translation is finished | finished, obviously with timestamps and usable |
Communication
- Twitter: www.twitter.com/c3subtitles
- Mailinglist: subtitles-angels -at- lists.selfnet.de
- Mailinglist: subtitles -at- lists.ccc.de
- IRC: #subtitles auf hackint. Requires SSL. - but also the #voc channel
- Etherpad-Domain: https://subtitles.pads.ccc.de
- Jabber: c3subtitles -!-at-!- jabber.ccc.de
- Videos on amara.org : c3subtitles videos on amara.org
- E-Mail: subtitles -!-at-!- c3voc.de
What is our goal?
Better and more barrierfree access to the live talks and streams and to the videos afterwards via subtitles. Especially for non-natives of the spoken languages and for deaf and hard of hearing listeners.
Nice side effect: finished subtitles are pretty easy to translate in any other language, amara.org also provides a very easy usable interface for that purpose.
How can I help?
- If you visit the congress and are a user of a speech recognition software, please contact us! Also if you are a computer stenography writer or a good touch typist.
- If you are interested in what we are working on behind the scenes, just contact us!
- Help us creating the subtitles via amara.org - you do not even have to visit the congress to do that! Everybody from at home can do that!
What are our current projects behind the scenes?
- Devoloping software for a user interface to choose which subtitle you want to work on depending on your favorite task
- Developing software for subtitles via computer stenography or speech recognition, visible live in the talk via webstream and later as start for the precise version to work on in amara.org
- Developing a phonetic german steno keyboard layout
- Building a steno keyboard
- Using an old mechanical stenographer with a micro controller to detect the pressed keys as steno input