Hi guys,
I started developing a Data Science project within the company I currently work for. Looking for open source solutions and possible integrations with Qlik I found Pytools. This Server Side Extension provides algorithms for advanced analysis in Qlik Sense, making data science algorithms more accessible for business areas.
The Qlik Extension (SSE) was built using a series of Python algorithms intended to provide a set of functions that can be used as expressions in Qlik Sense. Because the project is open source, customization and creation of new algorithms is open to everyone as needed.
Along with this project, I am applying the concept of Data Literacy with the focus of teaching business areas about the importance of reading and writing data. This way, company employees can make more confident, data-driven decisions. Improving analytical, statistical and analytical skills has been one of the biggest challenges so far.
This release includes the following implementations::
About the Setup process, development and presentation.
Algorithms and its expressions:
Clustering
This algorithm uses the following expression
PyTools.Cluster([ID],$(vMetrica)& ';' & $(vNMetrica2), 'scaler=quantile,min_cluster_size=3,min_samples=2')
(Image does not represent a real scenario due to data confidentiality)
Linear correlation
Correlation algorithm uses the following expression
Pytools.Pearson($(vMetrica1), $(vNMetrica2))
Dashboard - Clustering
Set up a clustering dashboard using HDBSCAN and its parameters.
(Image without data due to data confidentiality)
Recommendations
Doubts about development or material need I can share a PDF File with some instructions.
I post this as a outreach to the Community and find other data scientists who want to use this SSE or are interested in sharing experiences with this tool.
All SSE development was done by Nabeel Oz. On the GitHub link you can find the project's base setup in English https://github.com/nabeel-oz/qlik-py-tools
Remember, using this project as a base is a great way to start a Data Science project. With great base algorithms, you can customize to your needs and work with Data Literacy education within the enterprise environment without a large upfront investment.
Best regards and Qlik for the win.
Nabel,
I am receiving this error message when I attempt to install pytools in multi-node environment. Please see below for the error message. I have installed the pre-requisites software mentioned in your document.
'activate' is not recognized as an internal or external command,
operable program or batch file.
The system cannot find the path specified.
python: can't open file '__main__.py': [Errno 2] No such file or directory
Press any key to continue . . .
Hello! Thanks for this post. I'm starting also a new data science projet in the company I'm working and after some research I found this extension a good way to do data science with Qlik. I'm currently trying to implement my own fonctions within the pytools extension but I'm having a hard time finding how to develop new functions. I already have a python file with some forecasting that I want to use, but I'm having trouble getting this script in the right "shape" so it can fit the extension and I can use it directly doing Pytools.MyFonction if you know what I mean. Any ideas ?