BRIGHTCODE – Michał Żarnecki Portfolio

Hi, I'm Michał Żarnecki — Programmer, Machine Learning Specialist, and Educator. I specialize in building innovative systems and solutions at the intersection of artificial intelligence, machine learning, and data-driven technologies. With a strong foundation in Python and PHP, my work focuses on delivering impactful results and web based systems in areas such as data mining, big data, and natural language processing. On this website you can check some of my projects and recent activity.

Category: events

Fot. Adam Stepien / ministerstwoportretu.pl

Python Summit 2024 at the Copernicus Science Center in Warsaw

Posted on 11 December 2024  in events

During my talk I shared the journey of building a system that maps unstructured company descriptions to official industry codes which is challenging because of 1800 possible classification categories and multiclass classification.

In presentation I described how over the years, we’ve evolved our solutions:
– From random forest classifier
– To zero-shot + classification dedicated Large Language Model (LLM)
– And finally, LLMs + Retrieval Augmented Generation (RAG)

, , , ,

[Top]

Category: events

Data Science Summit 2024 at PGE National Stadium in Warsaw

Posted on 21 November 2024  in events

Lecture: Classifying unstructured texts into 1800 categories!
Problem: In this presentation, I will examine the development of a text classifier created by the team at CompanyHouse AG to address the challenge of classifying unstructured texts that describe companies’ activities into the official German industry codes, WZ 2008. Over the years, we have experimented with various techniques to manage classification across a vast number of categories (1,800 in total). I will discuss the strategies we employed to tackle this complexity and demonstrate the evolution of our model from a random forest classifier to an innovative solution based on large language models and retrieval-augmented generation (RAG) techniques.

Methodology: Our approach includes a range of methodologies: multiclass classification, retrieval-augmented generation (RAG), random forest classifiers, similarity algorithms, embedding techniques, and the use of vector databases.

Conclusions: Integrating additional knowledge into models using retrieval-augmented generation combined with similarity algorithms and techniques such as chain-of-thought reasoning can effectively address complex multiclass classification problems. This approach achieves high evaluation scores and outperforms pre-trained classifiers.

, , , , , ,

[Top]

Category: events

Michał Żarnecki on PHPers Summit 2024

PHPers Summit Conference 21.06.2024 Poznań

Posted on 21 June 2024  in events

Lecture: Retrieval Augmented Generation – replace complex logic with a text model
Abstract: Have you ever encountered code with so many conditions and processing scenarios that it was almost impossible to maintain and extend it? What if we replaced it with an automatically generated, self-improving algorithm? In recent years, machine learning as a field of artificial intelligence has become an effective tool for creating systems and applications. The dynamic development of models based on artificial neural networks means that the programming of complex business rules and services based on the summarization and extraction of information can be successfully replaced with machine learning models. In this presentation, you will see a case study illustrating the process of building a simple application based on RAG (Retrieval Augmented Generation) in PHP using a large text model (LLM) to effectively find precise answers in a database of unstructured text data.

[Top]

Category: events

PHPCon Conference 17-18.11.2023 Zawiercie

Posted on 18 November 2023  in events

ML with PHP – replace complex business logic with machine learning models

Abstract: Have you ever encountered code with so many conditions and processing paths that it was almost impossible to maintain and extend? What if we replaced it with an automatically generated, self-improving algorithm? In recent years, machine learning as a field of artificial intelligence has become an effective tool for creating systems and applications. With the development of artificial neural networks, programming complex business rules and services based on prediction and classification can be replaced by pre-trained machine learning models. In this presentation, you will see a case study illustrating the potential of PHP in integrating machine learning. We will walk through the process of creating a classifier and placing it in a PHP-based project.

, , , , ,

[Top]

Category: events

PHPers Summit Conference 27.05.2023 Poznań

Posted on 27 May 2023  in events

Lecture: Exploring the viability of PHP for implementing artificial neural networks: A case study on autonomous vehicle control with CNN model

Abstract: In recent years, machine learning has become an essential tool for developing intelligent systems. With the rise of artificial neural networks, programming languages such as Python and R have become the go-to options for machine learning implementation. But is PHP a viable alternative? In this presentation, we will explore the potential of PHP for implementing artificial neural networks by examining its limitations compared to other popular languages. We will also demonstrate the application of machine learning in PHP through a case study where we trained a convolutional neural network model to control a prototype of an autonomous vehicle using Raspberry Pi and Nvidia’s “DAVE 2” CNN model architecture.

, , ,

[Top]