The Team Behind Statistics Globe

Welcome to the About section of Statistics Globe, where we introduce the individuals behind our education and consulting platform. Since Statistics Globe was founded in 2017, it has become a well-known source of expertise in statistics and data science, as well as in the programming languages R and Python.

Our team consists of highly skilled data scientists and technical staff, each bringing their own experience and specialized expertise to our platform. The collaborative spirit in our team ensures the quality and innovation in all our offerings, from free tutorials to specialized online courses, and individual consulting services.

On this page, we want to give you a brief overview about each of the team members behind this platform. Please find our individual profiles below.

Our Main Team

The main Statistics Globe team consists of the people, which you can find below. They are responsible for statistical and programming content, website development, and social media appearances. You can click on each image for more information about each team member.

Joachim Schork

Statistician & Founder of Statistics Globe

Joachim Schork is the founder of Statistics Globe and leads the platform's content on statistics, data science, and programming in R and Python. With a Master's degree in Survey Statistics and several years of working experience in Official Statistics, Joachim specializes in topics such as missing data imputation, statistical matching of different data sources, and sampling design of surveys. He is the primary contact for consulting clients, speaker in Statistics Globe's online courses, and responsible for creating new tutorials. Additionally, he directs the strategic planning of the website, YouTube channel, and social media platforms. Read more about Joachim...

Matthias Gengenbach

IT Services, Web Hosting & Data Security

Matthias Gengenbach is our esteemed IT expert at Statistics Globe, responsible for web hosting, ensuring lightning-fast article loading, and managing all aspects related to data security and HTML/PHP optimization. With a deep understanding of server hardware, WordPress themes, SQL server optimization, and PHP, Matthias is the go-to person for any related inquiries. His proficiency surpasses standard IT tasks, as he also delves into large SQL databases to sharpen his analytical expertise. This blend of infrastructure management and data analysis underscores his comprehensive skill set. Read more about Matthias...

Micha Gengenbach

SEO, Social Media & Video Editing

Micha Gengenbach administrates the online presence of Statistics Globe, managing, developing, and optimizing the website, YouTube, and social media platforms. With a certification in business administration and extensive experience in a sales back-office and as a sales representative, Micha brings a unique blend of marketing savvy and technical skills to the team. He is in charge of video editing for the YouTube channel and increases our visibility on social media networks including LinkedIn, Facebook, Twitter/X, Instagram, and TikTok. Micha's role is pivotal in enhancing our digital appearance and engaging with our global audience. Read more about Micha...

Former Team Members & Guest Contributors

Some former team members no longer produce new content for Statistics Globe. However, you can find their previous contributions on their profile pages. Furthermore, Statistics Globe regularly features tutorials of guest authors.

Below, you can find all the people who have produced content for Statistics Globe in the past. You may click on the images to get further information about these people, as well as a list of their content contributions to Statistics Globe.

Ifeanyi Idiaye

200+ Articles

Cansu Kebabci

50+ Articles; 3 Videos

Ömer Ekiz

50+ Articles

Paula Villasante Soriano

50+ Articles

Anna-Lena Wölwer

50+ Articles

Kirby White

10+ Articles; 9 Videos

Matthias Bäuerlen

10+ Articles

Gottumukkala Sravan Kumar

10+ Articles

Pierre Olivier

4 Articles

Wolf Riepl

1 Article; 1 Video

Albert Rapp

1 Video

Ines Röder

1 Article

Samer Hijjazi

1 Video

Neal Fultz

1 Article

Philip Gigliotti

1 Article

Jim Frost

1 Article

Adrián Álvarez Molina

1 Article

Social Media & Contact

A major goal of the Statistics Globe platform is to exchange with other statisticians, data scientists, programmers, and researchers of any field. Please let us know in the comments or on social media, in case you have any questions or topics you want to discuss!

In case you would like to follow Statistics Globe on social media or if you intend to contact us via email, you can find all the details below.

57 Comments. Leave new

ahmed
August 9, 2021 3:59 pm

how are you sir
i want (ggplot ) for clusters for example bar plots of clusters

thanks

Reply
- Joachim
  August 11, 2021 4:29 pm
  
  Hey Ahmed,
  
  I’m fine and you? 🙂
  
  Thank you for the topic suggestion! Are you looking for something like this? https://statisticsglobe.com/draw-stacked-barplot-in-r
  
  Regards
  
  Joachim
  
  Reply
Shruti
September 1, 2021 5:42 pm

Hi Joachim,
Hope you are well.
Your tutorials are very helpful for understanding R programming for people without any background of the same.
I’m hardcore biologist and wish to do data analyses and statistical analyses using R. However, I’m having trouble in finding reference for arranging my data (data frame) for certain packages (e.g.: vegan, microbiome, phyloseq, etc.)
Can you help with these?

Regards,

Shruti

Reply
- Joachim
  September 2, 2021 6:07 am
  
  Hey Shruti,
  
  I’m fine, and you? 🙂
  
  First of all, thanks a lot for the very kind words! Glad to hear that you like the Statistics Globe tutorials!
  
  Regarding your question: I’m not a biologist, so I have never worked with these packages. However, could you explain how your data should be modified? Maybe we can still find a solution.
  
  Regards
  
  Joachim
  
  Reply
  - Shruti Shah
    September 13, 2021 5:11 pm
    
    Thank you so much for your reply.
    I deal with with environmental samples and I use molecular methods to draw certain inferences based on data that I get (metagenomic) and relating it to various environmental factors such temperature, pH, etc and sometimes intra-comparison within the samples. There are packages in R (Bioconductor actually) like the ones I mentioned which are widely used and a re known to help create and obtain different interpretations. And my major query is that for each of these packages, for R to read the file, the data has to be arranged in a particular way, so how do we know it or is there a way where changes in the script would make file readable
    
    -Regards,
    Shruti
    
    Reply
    - Joachim
      September 14, 2021 6:01 am
      
      The required data format depends on the functions and packages you are using. First, I would have a look at the help documentation of the function you want to use (i.e. write a ? in front of the function name and execute it in the RStudio console). This should give you an idea on how to format your data.
      
      Once you know the required format, you can use the search function on Statistics Globe. I have created many tutorials on data manipulation already. Some tutorials that might be useful to manipulate your data:
      
      https://statisticsglobe.com/reshape-data-frame-from-wide-to-long-format-in-r
      https://statisticsglobe.com/r-order-data-frame-rows-according-to-vector
      https://statisticsglobe.com/add-column-to-data-frame-in-r
      
      If you don’t find a solution, you may post your question to the Statistics Globe Facebook group. In this group, people can ask questions about R programming and statistics. By posting there, others can contribute as well: https://www.facebook.com/groups/statisticsglobe
      
      I hope that helps!
      
      Joachim
      
      Reply
Shruti Shah
September 14, 2021 12:06 pm

Thank you so much for your reply. I would surely try this and see if it works.

-Regards,
Shruti

Reply
- Joachim
  September 14, 2021 3:27 pm
  
  You are very welcome Shruti! I hope you’ll find a solution soon 🙂
  
  Regards
  
  Joachim
  
  Reply
Shruti Shah
September 15, 2021 4:53 pm

Yes, thank you. I hope to find a solution soon too ☺😇

Regards,
Shruti

Reply
Lasith Ranasinghe
November 4, 2021 9:19 am

I just wanted to say thank you for all of your videos, they have been incredibly helpful throughout the project that I’m conducting at the moment. I only started learning R a couple of months ago but I feel like I’ve come a long way thanks to your videos. Whenever I face a problem, I type it into youtube and the first thing that turns up is a clear video by you explaining exactly how to do it! Really appreciate all the hard work you have put into statistics globe, it is the best R resource I’ve used!

Reply
- Joachim
  November 4, 2021 11:06 am
  
  Wow, thank you so much Lasith! This is a wonderful feedback, and I’m very happy to hear that my videos helped to improve your R programming skills!
  
  Good luck with your further learning progress! 🙂
  
  Regards,
  Joachim
  
  Reply
Gottumukkala Sravan Kumar
November 14, 2021 2:48 pm

Hello team, I had one year of experience in TECHNICAL CONTENT WRITING (freelance) and TECHNICAL CONTENT REVIEWER (3 Months experience)
Here’s my articles:

https://auth.geeksforgeeks.org/user/sravankumar8128/articles

I am interested to contribute my knowledge in your platform
On Python,R, PHP etc

Reply
- Joachim
  November 15, 2021 9:02 am
  
  Hi Gottumukkala,
  
  Thank you very much for your comment!
  
  I have emailed you to discuss further details.
  
  Regards,
  Joachim
  
  Reply
Richmond
December 2, 2021 5:41 am

Hello Joachim,
Hope you are doing well. I want to find out if you could do a video on Bayesian mixing models using R Statistics software. Thank you.

Reply
- Joachim
  December 2, 2021 8:18 am
  
  Hey Richmond,
  
  Thank you for the topic suggestion! I’ll put it on my to-do list for future tutorials.
  
  Regards,
  Joachim
  
  Reply
Maitreyee Katre
December 18, 2021 3:35 pm

Hey Joachim,
I chanced about this website today, and I am really amazed how simply the tutorials are explained. They have helped me a lot in my project, I just want to say Thank you for your work.

Reply
- Joachim
  December 20, 2021 7:06 am
  
  Hey Maitreyee,
  
  Thank you very much for this wonderful feedback! It’s great to hear that my tutorials are helpful for your project! 🙂
  
  Regards,
  Joachim
  
  Reply
Anthony
January 11, 2022 12:49 am

Hello i have a r project but little struggling, can i have your help? It s for put two charts with a diffferent x-axis beginning in the same point in the xaxis (0)

Reply
- Joachim
  January 11, 2022 5:58 am
  
  Hey Anthony,
  
  Please have a look here for Base R plots: https://statisticsglobe.com/set-axis-limits-in-r and here for ggplot2 plots: https://statisticsglobe.com/set-axis-limits-in-ggplot2-r-plot
  
  Regards,
  Joachim
  
  Reply
samin
January 11, 2022 6:10 pm

Hi, I have a question about sensitivity analysis in R, i fitted multinomial logit model, we have equations like this:

consider this equation for example,

V1 = ASC1 +a1×Inc+b1×HS+c1×Cld+e1×Dri+ f1×Edu+ g1×Loc+h1×HD+m1×Rnt

for example , V1 is the utility for having one car in the households, we have V0=0 as a base category to compare each category with this reference.

we have also V2, V3, V4 for having 2,3,4 or more cars,

we estimated a1,b1,c1, and so on by the multinomial logit model, now I want to test the change in household income; we want to test both a 25% decrease (reflecting a possible economic downturn) and a 25% increase; and see the effects on the predicted number of cars, I know I should use the equations resulting of the model and predict function in R , but i do not know how?

Best Regards

Reply
- Joachim
  January 12, 2022 6:44 am
  Hey Samin,
  
  Unfortunately, I’m not an expert for sensitivity analysis. However, I have created a quick example on how to use the predict function, so hopefully this is helpful to you:
  data <- data.frame(y = rnorm(100), # Example data x = rnorm(100)) my_mod <- lm(y ~ x, data) # Estimate linear regression model data_new <- data.frame(x = rnorm(50)) # New data frame without y data_new_predictions <- predict(my_mod, data_new) # Predict y for new data frame data_new_predictions # Print predictions for y
  In addition, you may ask your question in the Statistics Globe Facebook group: https://www.facebook.com/groups/statisticsglobe
  
  Regards,
  Joachim
  Reply
Sarah Magd
January 28, 2022 10:03 am

Hello,
I have built a package in R (with the name MSB), but I forgot to run the dependencies of this package first. So, I need to remove this package and reinstall it again.
When I write the CMD commands it gives the following error:

> R CMD REMOVE MSB
Error: unexpected symbol in “R CMD”

Can you please help me with this problem?

Reply
- Joachim
  February 7, 2022 11:46 am
  
  Hey Sarah,
  
  Usually, you should be able to simply use the install.packages function to re-install a package. Have you already tried that?
  
  Regards,
  Joachim
  
  Reply
iqbal hussain
March 5, 2022 7:19 pm

hey sir im a statistics student a researcher and im doing a research on bringing an accuracy in classification in machine learning so i need some help in R can i contact you personally ?

Reply
- Joachim
  March 9, 2022 11:20 am
  
  Hey Iqbal,
  
  Due to time-constraints, I do not offer 1-on-1 consultancy. However, I have recently created a Facebook discussion group where people can ask questions about R programming and statistics. Could you post your question there? This way, others can contribute/read as well: https://www.facebook.com/groups/statisticsglobe
  
  Regards,
  Joachim
  
  Reply
Al exander
May 29, 2022 9:11 pm

Hi Joachim, not a request, not a question – but I just wanted to thank you and the team for the great work! I keep coming back to your website, since it has up-to-date, high-quality, reliable answers to just about any question I have regarding R. Keep up the amazing work!

Reply
- Joachim
  June 6, 2022 4:25 pm
  
  Hey Alex,
  
  Thank you so much for the amazing feedback, it’s really great that you find our content helpful!
  
  Best regards,
  Joachim
  
  Reply
Geetansh Sahni
June 5, 2022 4:25 pm

Hi, Sir, I have two years of experience in contributing 500+ articles on the following platforms
And I am also the reviewer of the same.
I know Python, R, etc

I can contribute python -pandas, NumPy, pyspark , R language, and other python related modules to your website
Here are my past PUBLICATIONS on the following platforms: https://auth.geeksforgeeks.org/user/geetansh044/articles

Reply
- Joachim
  June 7, 2022 8:10 am
  
  Hi Geetansh,
  
  Thank you very much for the offer. I’m currently not looking for any further help, though.
  
  Best regards,
  Joachim
  
  Reply
Navya
July 14, 2022 1:04 am

Hello Joachim,
Thanks for the wonderful. I’m trying to learn R and I have some struggles in solving an economic data set. Would you be able to help me please?
Thanks,
N

Reply
- Joachim
  July 17, 2022 7:31 am
  
  Hey Navya,
  
  Thank you very much for the kind comment, glad you like the tutorials on Statistics Globe!
  
  Sure, what is your specific question?
  
  Regards,
  Joachim
  
  Reply
Cecilia Ramirez
September 30, 2022 9:06 pm

Hello, Joachim.
I am sorry to bother you with this question. I know it’s off topic, but I wanted to try at least. So I have this code in R:

my_pmatch <- function(q, size, lower.tail=TRUE){
if(!(floor(size) == size) | size < 1){
stop("size must be a whole number")}else{
for(i in 1:q){
if(q < 0){
return(0)} else {
return(print(my_pmatch (1,size)))}}}}

I want it to implement the p-series function for a Matching size distribution and return an answer for any numeric q. If you could give me any hints as to what is wrong with it, I would really appreciate it. If don't want to answer me, then no problem, I will understand. Thank you for your attention and have a nice day.

Reply
- Joachim
  October 5, 2022 8:36 am
  
  Hi Cecilia,
  
  Thank you for your kind message. I just had a look at your function, and for me, it looks strange that you call your own function within this function (i.e. this line of code: return(print(my_pmatch(1,size)))).
  
  Could you please explain what you are trying to do in this line?
  
  Regards,
  Joachim
  
  Reply
  - Ana Cecilia Ramirez Licon
    October 5, 2022 4:59 pm
    
    Hello, Joachim.
    Thank you for answering my comment. What I was trying to do with that line is for the code to return a numeric answer, using the p series in a Matching (size) distribution. I though that this line will command R to return a numeric answer, following the conditions above the line. I don’t know if I am explaining myself correctly. Thank you for reading my comments and for your help. Have a great day.
    
    Reply
    - Joachim
      October 6, 2022 6:57 am
      
      Hi again,
      
      Thank you for the clarifications. I’m afraid it is not possible in the way you are trying to do this. The first part of your function looks fine to me, but this last line should be modified.
      
      Unfortunately, I’m not an expert on p series in matching distributions, so I cannot help you in more detail. However, I have recently created a Facebook discussion group where people can ask questions about R programming and statistics. Could you post your question there? This way, others can contribute/read as well: https://www.facebook.com/groups/statisticsglobe
      
      Regards,
      Joachim
      
      Reply
      - Ana Cecilia Ramirez Licon
        October 6, 2022 9:15 pm
        
        Thank you for answering my last comment, and thank you for all your help and for recommending me the Facebook page. I will check it out. Have a great day.
      - Joachim
        October 13, 2022 8:00 pm
        
        You are very welcome Ana, I hope the Facebook group was helpful!
        
        Regards,
        Joachim
ahmed
October 12, 2022 6:39 pm

I get error

data<-read.csv(file.choose(),sep=',')
data = data[ 1:1535,]

my_list2<-list()

for(i in 1:ncol(data)){
my_list2[[i]]<-data [,i]
}

names(my_list2)<-colnames(data)

my_list2

cfg <- compare_clusterings_configs(
types = "partitional",
k = 20L,
controls = list(
partitional = partitional_control(
iter.max = 20L
)
),
distances = pdc_configs(
"distance",
partitional = list(
ACF = list(
window.size = seq(by = 1L),
norm = c("L1")
)
)
)
)

evaluators <- cvi_evaluators("ARI", ground.truth = my_list2)
comparison <- compare_clusterings( my_list2 , types = "partitional",
configs = cfg, seed = 8L,
score.clus = evaluators$score,
pick.clus = evaluators$pick)

# some rows and columns from the results data frame
head(comparison$results$partitional[, c("distance","ARI")])

i get this

(Error in { :
task 1 failed – "'list' object cannot be coerced to type 'integer'")

Reply
- Joachim
  November 14, 2022 12:31 pm
  
  Hey Ahmed,
  
  Please excuse the delayed response. I was on a long holiday, so unfortunately I wasn’t able to reply sooner. Still need help with your code?
  
  Regards,
  Joachim
  
  Reply
Isaac
October 26, 2022 3:17 pm

Hello Joachim,
It is a pleasure to come to this website because I can understand very easily what you explain. And I have gone to other sites to clarify how R works and I don’t see this didactic capacity that you have here. Thank you very much and congratulations

Reply
- Joachim
  November 11, 2022 3:02 pm
  
  Hey Isaac,
  
  Thank you so much for the very kind feedback! It’s really great to hear that you enjoy reading the tutorials on Statistics Globe!
  
  Regards,
  Joachim
  
  Reply
Jack Shaw
October 31, 2022 1:11 pm

Hi Joachim,

Hope you’re well.

I’m looking to develop the following in R.

https://github.com/MatthewSmith430/CompaniesHouse

I’m hoping you can help; can you please email me to discuss in more detail.

Thanks,

Reply
- Joachim
  November 14, 2022 12:46 pm
  
  Hi Jack,
  
  I apologize for the delayed reply. I was on a long vacation, so unfortunately I wasn’t able to get back to you earlier. Do you still need help with your syntax?
  
  Regards,
  Joachim
  
  Reply
yousif
November 4, 2022 4:46 pm

Hi , could you help please
Write three R functions called “Test1.R.Function()”, “Test2.R.Function()”, “Test3.R.Function()”, to perform the simulation study presented in section 4 “4 A Simulation Study” of the paper “Stavros_2004.pdf”. Repeat the simulation study as conducted and presented in section 4 for each test with only N = 1000 repetitions. Compare your results with the simulation results presented in tables:

Table I(a) and Table I(b) for Test 1

Table II(a) and Table II(b) for Test 2

Table III(a) and Table III(b) for Test 3

Reply
- Joachim
  November 14, 2022 12:51 pm
  
  Hi Yousif,
  
  I apologize for the delayed reply. I was on a long holiday, so unfortunately I wasn’t able to get back to you earlier. Do you still need help with your syntax?
  
  Regards,
  Joachim
  
  Reply
Yousif
November 14, 2022 1:08 pm

Yes please

Reply
- Joachim
  November 15, 2022 8:56 am
  
  Hi again,
  
  I don’t want to do your entire assignment for you. Could you please show the code you have tried yourself so far?
  
  Regards,
  Joachim
  
  Reply
Melike
March 9, 2023 8:25 pm

Dear Joachim,

I work on my thesis with R programming. I use catR package for my analises. I have trouble with genDichomatrix function. Because in that function there is a part called content balancing method.

if (!is.null(cbControl)){
props<-cbControl$props
names<-cbControl$names
props<-props/sum(props)
nr<-round(items*props)
f<-function(items,nr){
res<-matrix(0,length(nr),items)
for (i in 1:items){
res[,i]<-rmultinom(1,1,nr)
nr<-nr-res[,i]
}
return(res)}
pr<-f(items,nr)
g<-function(x) (1:length(x))[x==1]
it<-apply(pr,2,g)
groups<-NULL
for (i in 1:items) groups[i]<-names[it[i]]
groups<-as.factor(groups)
RES<-data.frame(a,b,c,d,Group=groups)
}
else RES<-data.frame(a,b,c,d)
return(RES)}

I want to modify this code to modified multinomial model. To do this 'props' part is need to be cumulative. But i dont know how to do is. Can you help me???

Reply
- Cansu (Statistics Globe)
  March 14, 2023 4:41 pm
  
  Hello Melike,
  
  Maybe this tutorial of ours can help with cumulative proportion calculation.
  
  Regards,
  Cansu
  
  Reply
  - Melike
    March 19, 2023 12:15 pm
    
    Cansu Hanım merhaba,
    
    Yanıtınız için çok teşekkür ederim. Yalnız ben yanlış fonksiyon üzerinde uğraşıyormuşum onu farkettim. Şimdi diğer fonksiyon için uğraşıyorum. İzniniz olursa özelden görüşmemiz mümkün mü acaba? Yani mailinizi alabilir miyim? Gerçekten alabilirsem çok sevinirim. Sevgiler….
    
    Reply
    - Cansu (Statistics Globe)
      March 20, 2023 8:41 am
      
      Merhaba Melike,
      
      Yardimci olmak isterdim fakat özel destek sunmuyoruz. Yeni fonksiyonla ilgili sorunuzu da buraya eklerseniz, yardimci olmaya calisirim.
      
      Sevgiler,
      Cansu
      
      Reply
hesam aboulmaali
April 12, 2023 8:48 pm

greeting, I am the ceo of small company, centraspheres. we are in area of human capital management. I am looking for company like yours for technical support, my email is as follow

Reply
- Joachim (Statistics Globe)
  April 19, 2023 7:41 am
  
  Hi Hesam,
  
  Thank you for your request. Currently, we don’t offer paid consultancy. In case you have shorter questions about your code, you may ask them in the Statistics Globe Facebook group here.
  
  I hope this helps!
  
  Joachim
  
  Reply
Anna
April 24, 2023 8:56 am

Hi to the Statistics Globe team,

your tutorials helped me through a lot of issues with my code already. I now want conduct a Multi-Trait-Multi-Method-Analysis in R. I find the mtmm() function of the psy package not very intuitive. Multicon won’t install with my current version of R (4.2.0), however, it states more clearly what data structure is needed for the analysis. Do you know how data has to be organized for mtmm()? Or if it is possible and useful to still use the multicon package nowadays?

Warmly,
Anna

Reply
- Cansu (Statistics Globe)
  April 27, 2023 12:16 pm
  
  Hello Anna,
  
  I haven’t run that analysis personally. But I could also recommend using the lavaan package if you have trouble with other packages. It is a commonly used package for factor analysis.
  
  Regards,
  Cansu
  
  Reply
Ana Carolina Barco Leme
August 20, 2023 12:02 am

Hi Joachim and Statistics Globe team,

I would like to congratulate all of you for your tutorials, it is very easy to understand and super organized, especially for those who are R beginners (like me). I have a question that is not on your tutorials, but I bet you can help me solve this in a simple way. I have been spending some days to figure it out but I couldn’t.
I would like to group and match specific categorical variables (i.e., weekdays, hours of consuming certain foods, and eating occasions – i.e., breakfast, lunch, dinner, and others) by the IDs. It is a large dataset and it has two reports on two different days of consumption, in this scenario, I have to separate the two reports and take the average intake of each energy and nutrient consumed. I bet that the first step is to separate this into two datasets one with one record and another dataset with the second record. The database is in a long format, I think maybe I should convert to a wide format.

The dataset name is HEIFI_2019
Variables are COD_UPA (ID); Weekdays (Sunday to Monday); hours (0h to 23h); eating occasions (breakfast, lunch, dinner, and, other).
Thanks,

Reply
- Cansu (Statistics Globe)
  August 21, 2023 9:13 am
  
  Hello Ana,
  
  Thank you very much for your kind words. So you want to find the average food consumption per weekday and per eating occasion for each report separately? If so, would you like to group the data for each category separately or combined? Do you want to obtain twıo datasets, in the end, one for repot1 and the other for report2? What do you like to do with ID, I also didn’t get that well.
  
  Best,
  Cansu
  
  Reply

The Team Behind Statistics Globe

Our Main Team

Former Team Members & Guest Contributors

Social Media & Contact

57 Comments. Leave new

Leave a Reply Cancel reply