Textmining in Corporate Reporting (5 ECTS, in English)

Textmining in Corporate Reporting – Financial and Sustainability (5 ECTS, in English)

Ein Alternative für Studierende, die sich für Datenanalysen und Maschinelles Lernen (Künstlicher Intelligenz) interessieren, aber die von der Einstiegshürde einer Programmiersprache abgeschreckt werden, ist die Nutzung von „RapidMiner Studio“. RapidMiner ist ein führendes Tool für „Data Science and Machine Learning“, das wegen seiner graphischen Benutzeroberfläche sowohl leicht zu bedienen als auch leistungsfähig ist.

Es wird daher bei uns häufig für Abschlussarbeiten genutzt. RapidMiner wird an der WiSo auch in der Wirtschaftsinformatik eingesetzt. Es ermöglicht den leichten Einstieg in Data Science Methoden, ohne Kenntnisse einer Programmiersprache wie Python oder R vorauszusetzen. EY, PwC, Deloitte, Warth & Klein u.a. nutzen RapidMiner. Die erlernten Methoden können aber später auch in anderen Tools genutzt werden.

RapidMiner umfasst Befehle für Lineare Regression, Logistische Regression, Entscheidungsbäume, Naïve Bayes Klassifikator, (einfache) Neuronale Netze, Deep Learning Neuronale Netze, Support Vector Machines, Diskriminanzanalyse, Clustering, Assoziationsregeln usw. (Video zur Vorhersage von Überlebenden der Titanic mit Verfahren des Maschinellen Lernens.)

Erweiterungen erlauben auch die Analyse von Texten (was mit Excel praktisch nicht geht):

Companies publish a large number of texts, most of which are now in digital form (as files) and are usually available via the Internet. These texts include legally required disclosures such as annual financial statements, sustainability reports, ad hoc disclosures, stock exchange listing prospectuses, analysts‘ conferences, etc., but also „voluntarily“ produced texts such as press releases, image brochures, product descriptions, etc.

Classical financial reporting is already complemented by sustainability reporting (CSR, ESG). However it will be soon compulsory for 15.000 (instead von 500) companies alone in Germany! CSR reporting is partly in numbers, but to a large degree in texts.

The texts constitute qualitative data. Text mining (other names: text analytics, natural language processing) includes algorithms that are used to automatically uncover knowledge in unstructured or semi structured text data. It is therefore a special application of data mining or artificial intelligence. Tasks include the classification of texts, the measurement of language tendencies such as sentiment, the extraction of specific content, the measurement of text similarities or the discovery of conspicuous outliers.

Rapidminer must be installed on your own computer. Previous knowledge in programming is NOT required! Installation instructions: https://docs.rapidminer.com/latest/studio/installation/

The course follows the „inverted classroom“ model consisting of a teaching part and an exercise part:

In the teaching part, tutorial videos are used to explain conceptual basics and then to demonstrate the implementation in Rapidminer, in particular by means of concrete demo examples. The demo data can be downloaded, so that the participants can follow all steps themselves.
In the exercise part, the participants apply the learned concepts to new cases and submit the solutions.
The evaluation with 5 ECTS is based on the completion of a larger assigment (submission of a homework with the code, an explanation of the code and a discussion of results).

Prof. Dr. Klaus Henselmann / Dr. Andreas Seebeck

Limited number of students
5 ECTS

Termine und Anmeldung zu Veranstaltungen

Aktuelle Termine finden Sie in der Übersicht aller unserer Veranstaltungen dieses Semesters im Master FACT auf StudOn:

https://www.studon.fau.de/cat92998.html

Dort finden Sie auch, wie man sich zur Veranstaltung anmeldet. Sie müssen dort auf StudOn sich zum entsprechenden Kurs anmelden und erhalten so Zugriff auf die weiteren Unterlagen