Dr. Gëzim Sejdiu left SDA. The profile below reflects the status at the point of his departure and is no longer updated.
PhD Student
Computer Science Institute
University of Bonn
Profiles: Homepage, LinkedIn, Google Scholar, GitHub, Research Gate, Twitter
Room 1.068
Endenicher Allee 19a, 53115 Bonn
University of Bonn, Computer Science
sejdiu@cs.uni-bonn.de
Short CV
Gëzim Sejdiu is a PhD Student & Research Associate at the University of Bonn. Gëzim’s research interest are in the area of Semantic Web, Big Data and Machine Learning. He is also interested in the area of distributed computing systems (Apache Spark, Apache Flink).
Research Interests
- Big Data
- Data Mining and Data Analysis
- Semantic Web and Semantic Search
- Machine Learning
- Distributed Computing
Projects
- Big Data Europe – Integrating Big Data, Software & Communities for Addressing Europe’s Societal Challenges
- SANSA – Open source platform for distributed data processing for RDF large-scale datasets
- BigDataOcean – Exploiting Ocean’s of Data for Maritime Applications
- Boost4.0 – Big Data Value Spaces for COmpetitiveness of European COnnected Smart FacTories 4.0
Teaching
- Courses
- Lab “Distributed Big Data Analytics” – (MA-INF 4223)
The goal is to provide experience and technical skills related to Big data processing tools like Flink and Spark, in addition, to make them acquainted with the functional programming style prevalent in concurrent and parallel programming for Big data.
(SoSe2019, WiSe2018/19, SoSe2018, WiSe2017/18, SoSe2017)
- Lab “Distributed Big Data Analytics” – (MA-INF 4223)
- Supervision
- Emetis Niazmand, since 2019; Master Thesis (co-supervision with Prof. Dr. Jens Lehmann)
- Gresa Halimi (University of Prishtina), since 2019; Master Thesis (co-supervision with Prof. Dr. Lule Ahmedi)
- David Ibhaluobe, since 2019; Master Thesis (co-supervision with Dr. Damien Graux and Prof. Dr. Jens Lehmann)
- Moumen Elteir, since 2018; Master Thesis (co-supervision with Prof. Dr. Jens Lehmann)
- Pardeep Naik, 2018 – 2019; Master Thesis: “An efficient recommendation system for RDF partitioners over large-scale RDF datasets” (co-supervision with Dr. Ioanna Lytra and Prof. Dr. Jens Lehmann)
- Mohammad Ghasemi, 2018 – 2019; Master Thesis: “An efficient semantic-based Entity-Resolution over Big RDF data with SANSA framework” (co-supervision with Prof. Dr. Jens Lehmann)
- Abakar Bouba, 2018 – 2019; Master Thesis: “RDF Data Compression Techniques in a Highly Distributed Context” (co-supervision with Dr. Damien Graux and Prof. Dr. Jens Lehmann)
- Gulnar Khalilova, 2018; Master Thesis (co-supervision with Dr. Anisa Rula and Prof. Dr. Jens Lehmann)
- Wang Zhe, 2018; Master Thesis: “Efficient In-memory Graph Partitioning Algorithms and Query Engine for RDF Data” (co-supervision with Dr. Ioanna Lytra and Prof. Dr. Jens Lehmann)
- Kunal Jha, 2018; Master Thesis: “Rule Mining on Distributed RDF Data” (co-supervision with Dr. Hajira Jabeen, Tommaso Soru, Michael Roeder and Prof. Dr. Jens Lehmann)
- Mohamad Denno, 2017 – 2018; Master Thesis: “Scalable deep learning technique for sensitive data exposure detection” (co-supervision with Dr. Hajira Jabeen and Prof. Dr. Jens Lehmann)
- Ali Denno, 2017 – 2018; Master Thesis: “Scalable Knowledge Graph Exploration for Sentiment Classification” (co-supervision with Dr. Hajira Jabeen and Prof. Dr. Jens Lehmann)
- Imran Khan, 2017 – 2018; Master Thesis: “Efficient and Scalable in-memory Semantic Partitioning for RDF Data” (co-supervision with Dr. Ioanna Lytra and Prof. Dr. Jens Lehmann)
- Nayef Roqaya, 2017 – 2018; Master Thesis: “Distributed Data Parsing and Vandalism Detection on Large Knowledge Graphs using Apache Spark and Hadoop Ecosystem” (co-supervision with Dr. Hajira Jabeen and Prof. Dr. Jens Lehmann)
- Rohan Asmat, since 2017; Web Development.
- Julius Kaufmann, since 2017; DevOps.
- Adrian Bajraktari, June – September 2018; Web Development.
Awards and Nominations
- Best demonstration award at International Semantic Web Conference 2017.
I. Ermilov, J. Lehmann, G. Sejdiu, L. Bühmann, P. Westphal, C. Stadler, S. Bin, N. Chakraborty, H. Petzka, M. Saleem, A. N. Ngonga, and H. Jabeen, “The Tale of Sansa Spark” in Proceedings of 16th International Semantic Web Conference, Poster & Demos, 2017. (Project Website, GitHub, Slides, Screencasts)
Presentations
- Towards A Scalable Semantic-based Distributed Approach for SPARQL query evaluation @SEMANTiCS 2019, 9-12.09.2019 (slides)
- DistLODStats: Distributed Computation of RDF Dataset Statistics @ISWC 2018, 8-12.10.2018 (slides)
- The Tale of SANSA Spark :: SANSA-Notebooks: Developer friendly access to SANSA @ISWC 2017, 21-25.10.2017 (slides, demo)
- Distributed Knowledge Graph Processing in SANSA @HPI Future SOC – Lab Day (Spring 2017), 25.04.2017 (video).
- A demo of Apache Flink with Docker on the BDE platform @2nd BDE Technical Webinar, 20.10.2016 (slides, video)
Workshops & Tutorials
- Workshops
- 1st Workshop on Large Scale RDF Analytics (LASCAR-19)
Half-Day Workshop at 16th European Semantic Web Conference 2019 (ESWC2019).
2nd – 6th June 2019, Portorož, Slovenia
- 1st Workshop on Large Scale RDF Analytics (LASCAR-19)
- Tutorials
- SANSA’s Leap of Faith: Scalable RDF and Heterogeneous Data Lakes
Half-Day Tutorial at 16th European Semantic Web Conference 2019 (ESWC2019).
2nd – 6th June 2019, Portorož, Slovenia
- SANSA’s Leap of Faith: Scalable RDF and Heterogeneous Data Lakes
Publications
2022
Efficient semantic summary graphs for querying large knowledge graphs Journal Article
In: Int. J. Inf. Manag. Data Insights, vol. 2, no. 1, pp. 100082, 2022.
2020
Efficient Distributed In-Memory Processing of RDF Datasets PhD Thesis
University of Bonn, Germany, 2020.
MINDS: A Translator to Embed Mathematical Expressions Inside SPARQL Queries Proceedings Article
In: Semantic Systems. In the Era of Knowledge Graphs - 16th International Conference on Semantic Systems, SEMANTiCS 2020, Amsterdam, The Netherlands, September 7-10, 2020, Proceedings, pp. 104–117, Springer, 2020.
DISE: A Distributed in-Memory SPARQL Processing Engine over Tensor Data Proceedings Article
In: IEEE 14th International Conference on Semantic Computing, ICSC 2020, San Diego, CA, USA, February 3-5, 2020, pp. 400–407, IEEE, 2020.
Scalable Knowledge Graph Processing Using SANSA Book Section
In: Knowledge Graphs and Big Data Processing, vol. 12072, pp. 105–121, Springer, 2020.
2019
Clustering Pipelines of Large RDF POI Data Proceedings Article
In: The Semantic Web: ESWC 2019 Satellite Events - ESWC 2019 Satellite Events, Portorov z, Slovenia, June 2-6, 2019, Revised Selected Papers, pp. 24–27, Springer, 2019.
Towards a Scalable Semantic-Based Distributed Approach for SPARQL Query Evaluation Proceedings Article
In: Semantic Systems. The Power of AI and Knowledge Graphs - 15th International Conference, SEMANTiCS 2019, Karlsruhe, Germany, September 9-12, 2019, Proceedings, pp. 295–309, Springer, 2019.
Querying Large-scale RDF Datasets Using the SANSA Framework Proceedings Article
In: Proceedings of the ISWC 2019 Satellite Tracks (Posters & Demonstrations, Industry, and Outrageous Ideas) co-located with 18th International Semantic Web Conference (ISWC 2019), Auckland, New Zealand, October 26-30, 2019, pp. 285–288, CEUR-WS.org, 2019.
Sparklify: A Scalable Software Component for Efficient Evaluation of SPARQL Queries over Distributed RDF Datasets Proceedings Article
In: The Semantic Web - ISWC 2019 - 18th International Semantic Web Conference, Auckland, New Zealand, October 26-30, 2019, Proceedings, Part II, pp. 293–308, Springer, 2019.
The Hubs and Authorities Transaction Network Analysis using the SANSA framework Proceedings Article
In: Proceedings of the Posters and Demo Track of the 15th International Conference on Semantic Systems co-located with 15th International Conference on Semantic Systems (SEMANTiCS 2019), Karlsruhe, Germany, September 9th - to - 12th, 2019, CEUR-WS.org, 2019.
Jekyll RDF: Template-Based Linked Data Publication with Minimized Effort and Maximum Scalability Proceedings Article
In: Web Engineering - 19th International Conference, ICWE 2019, Daejeon, South Korea, June 11-14, 2019, Proceedings, pp. 331–346, Springer, 2019.
A Scalable Framework for Quality Assessment of RDF Datasets Proceedings Article
In: The Semantic Web - ISWC 2019 - 18th International Semantic Web Conference, Auckland, New Zealand, October 26-30, 2019, Proceedings, Part II, pp. 261–276, Springer, 2019.
2018
Divided We Stand Out! Forging Cohorts fOr Numeric Outlier Detection in Large Scale Knowledge Graphs (CONOD) Proceedings Article
In: Knowledge Engineering and Knowledge Management - 21st International Conference, EKAW 2018, Nancy, France, November 12-16, 2018, Proceedings, pp. 534–548, Springer, 2018.
Profiting from Kitties on Ethereum: Leveraging Blockchain RDF with SANSA Proceedings Article
In: Proceedings of the Posters and Demos Track of the 14th International Conference on Semantic Systems co-located with the 14th International Conference on Semantic Systems (SEMANTiCS 2018), Vienna, Austria, September 10-13, 2018, CEUR-WS.org, 2018.
DistLODStats: Distributed Computation of RDF Dataset Statistics Proceedings Article
In: The Semantic Web - ISWC 2018 - 17th International Semantic Web Conference, Monterey, CA, USA, October 8-12, 2018, Proceedings, Part II, pp. 206–222, Springer, 2018.
STATisfy Me: What Are My Stats? Proceedings Article
In: Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, USA, October 8th - to - 12th, 2018, CEUR-WS.org, 2018.
2017
The BigDataEurope Platform - Supporting the Variety Dimension of Big Data Proceedings Article
In: Web Engineering - 17th International Conference, ICWE 2017, Rome, Italy, June 5-8, 2017, Proceedings, pp. 41–59, Springer, 2017.
Managing Lifecycle of Big Data Applications Proceedings Article
In: Knowledge Engineering and Semantic Web - 8th International Conference, KESW 2017, Szczecin, Poland, November 8-10, 2017, Proceedings, pp. 263–276, Springer, 2017.
The Tale of Sansa Spark Proceedings Article
In: Proceedings of the ISWC 2017 Posters & Demonstrations and Industry Tracks co-located with 16th International Semantic Web Conference (ISWC 2017), Vienna, Austria, October 23rd - to - 25th, 2017, CEUR-WS.org, 2017.
Distributed Semantic Analytics Using the SANSA Stack Proceedings Article
In: The Semantic Web - ISWC 2017 - 16th International Semantic Web Conference, Vienna, Austria, October 21-25, 2017, Proceedings, Part II, pp. 147–155, Springer, 2017.
2014
Semantic Ranking of Web Pages : The Wikipedia Case Study Masters Thesis
Faculty of Electrical and Computer Engineering, University of Prishtina, Kosova, 2014.
Ranking Authors on the Web: A Semantic AuthorRank Book Section
In: Social Networks: Analysis and Case Studies, pp. 19–40, Springer, 2014.