Interactive Labeling of Scan Segmentations

Fast timer icon

Thesis

Location icon

Karlsruhe

Interactive Labeling of Scan Segmentations

Kollegiengebäude am Kronenplatz (05.20), das ISSD befindet sich in der Triangel im 4. und 5. OG
Calendar
Immediately searched (unlimited)
Clock
30–40 h per week
Dollar
No salary specified
Remote work Icon
Remote work not possible
Bachelor's or Master's Thesis with the goal to design and develop an interactive labelling system for segmentation of advertisements from scanned newspaper archives. WHO CAN APPLY? Only enrolled students from KIT (Karlsruher Institut für Technologie) with course of studies Wirtschaftsinformatik, Wirtschaftsingenieurwesen, Informationswirtschaft, or Technische Volkswirtschaftslehre.
Responsibilities Icon

Problem

As the digitization of the worlds libraries and print archives continues steadily, the demand for automated processing of such documents grows. Hereby, resarchers and practicioners would like to digitally process such documents with tools from computer vision (CV) and optical character recognition (OCR). Further they would like to search and filter for certain document meta-data. However, all of this presumes the availablity of such extracted features and meta-data. As state-of-the-art machine learning (ML) classifiers still do not reach desired accuracy levels, especially on old documents or those from fringe contexts, manual labeling effort is required.

Occupational fields

Machine Learning Engineer (m/w/d)

Data Science & Artificial Intelligence

Tech

Advantages Icon

Requirements

We expect the student to be familiar with web development. The system should be devloped with a modern web application frontend framework (e.g. Vue with Vuetify) or be forked from an existing open source segmentation system. Further we expect the model to be trained based on standard Python frameworks. Experience in this regard is required as well.

Agile Working

Regular Feedback Meetings

User Icon

Goals

For the scope of this thesis, we limit the context to segmenting advertisements from scanned pages of newspapers and magazines. This poses an interesting use-case for, for instance, advertising researchers. Associated colleagues at the University of Mannheim (UniMA) have already manually created a labeled set of 9000 segmented pages of the US magazine "The Economist", ranging from the 1840s to today. We expect a thesis student to develop an interactive labeling system in order to support the extension of this segmentation traing data-set to many more pages. Interactive labeling hereby strives to combine automatic steps (e.g. the trained model) with incremental user input. The work-packages entail:
  • analyzing the state-of-the-art of such segmentation tools
  • exchange with the researchers at UniMA that created the training data-set regarding requirements and system evaluation
  • development of an interactive labeling system as part of a design science research process
    • train a ML classifier based on the existing training data
    • (potentially) include more training data from free data-sets
    • develop an interactive labeling tool that integrates the ML classifier with manual segmentation
    • include novel interaction paradigms with the existing ML classifier into the tool (manually reviewing those instances in which the model was uncertain, retraining the model based on new user input, ...)
  • writing a thesis document according to research group requirements & participation in our thesis colloquium
Design science research is a well established methodology in the information systems field, which deals with the scientific view on artifacts, such as the labeling system that should be developed during this thesis. Hereby so called design knowledge can be derived from the development process and the finished artifact. 

All levels welcome (no experience required)

Languages

English

German

Skill set

JavaScript

Python

Hypertext Markup Language

Cascading Style Sheets

Company Icon

About ISSD - KIT

The research group “Information Systems & Service Design” (ISSD) headed by Prof. Mädche focuses in research, education, and innovation on designing interactive intelligent systems. The research belongs to the Institute of Information Systems and Marketing (IISM) and is embedded into the Information Systems & Engineering group. ISSD is also part of the Karlsruhe Service Research Institute (KSRI). The research group is positioned at the intersection of Information Systems (german: Wirtschaftsinformatik) and Human-Computer Interaction (HCI). Our mission is to create impactful scientific knowledge for designing interactive intelligent systems that enable humans to perform activities more efficiently, effectively, and meaningful. We believe that delivering cutting-edge knowledge and inspiring education, as well as an ongoing dialog with the public need to go hand in hand to maximize the impact of our work in organizations and society. The group is organized in three research departments: Digital Experience & Participation, Intelligent Enterprise Systems, and Digital Service Design & Innovation. Current topics of research are Human-AI Interaction, Cognitive Interaction Technologies, Physiological Computing Systems, Interactive Business Intelligence & Analytics Systems, and Interactive Systems Engineering.

Foundation year icon
Founded in 1825
Employee icon
500-999 employees
Company sectors icon
Bildung
Company size icon
Global Player

By loading the map, cookies are set as specified in our data privacy. Learn more.

More information about the company
Frequently asked questions

Frequently asked questions

Arrow

Who or what is Campusjäger by Workwise?

Campusjäger is part of Workwise - a job platform that supports you throughout your entire career. We take care of recruiting for various companies and accompany you through the entire application process. Via Campusjäger by Workwise you can find jobs for students and graduates. You can manage your applications in your Workwise profile. Learn more about the connection between Workwise and Campusjäger.

Arrow

Is the job I see still available?

For jobs that are still open, you can click the 'Apply now' button. If this is not possible, the job has already been filled or temporarily deactivated.
Arrow

Which documents do I need for my application?

That depends entirely on the job you are applying for. In many cases it is sufficient to upload your PDF resume or fill out your Workwise profile.

Arrow

Where can I upload my records or documents?

You can upload your application documents in your Workwise profile. These can only be viewed by companies you are applying to.

Arrow

Where can I find more information about the company?

You can find more information in the company profile of ISSD - KIT.

Arrow

Can I process my application afterwards?

Yes, this is possible. In your application overview you can view your information and make changes. If you have already been invited to an interview, editing is no longer possible. However, you can still add general information and upload additional documents in your profile.

Arrow

How do I get news about my application?

In your application overview at Workwise you have an overview of the application progress at any time. Additionally, we send you emails about the most important status changes.

Arrow

Can I send several applications at once?

The number of your applications is not limited. An overview of your applications can be found at Workwise.

Arrow

Can other companies see where else I have applied?

No, companies can only see the applications they have received.
Arrow

Can I also contact the company's contact person directly?

Personal contact is possible via chat as soon as you have been invited for an interview. Before that, you will receive all important status changes by e-mail. If you have any questions, you can contact your personal Candidate Manager:in from Workwise.

Arrow

I don't think I meet all the requirements. Can I still apply?

Even if you don't meet all the requirements, you can make up for missing knowledge with additional skills. Use the application's questions to address your motivation and show the company why you are still a good fit for the job. If you don't meet many or all of the requirements, the application will not be successful.
Arrow

What do I have to consider if I am not from Germany?

Please make sure to provide all necessary documents within your Workwise profile. It should include an EU work-permit (if you have no EU citizenship) and a CV at least. Depending on the position you are applying to, you could also be asked for a certificate of enrollment, a transcript of records or a language certificate. We would also recommend to inform yourself thoroughly in advance about visa regulations. Therefore you can use the official visa navigator from the Federal Foreign Office.

Arrow

What do I have to consider if German is not my mother tongue?

Please take into account the job’s language requirements and make sure the requirements match your skills. In the job search you can use the language filter to find jobs without German language requirements. It is also helpful to provide language certificates. This section in our help center may support you during the application process.

Our job offer Interactive Labeling of Scan Segmentations sounds promising? We're looking forward to your application.

A similar job for you

Find similar jobs