The Team Behind Statistics Globe
Statistics Globe is an education platform that provides free programming tutorials in R and Python as well as theoretical explanations for the field of statistics and data science.
On this page, we want to give you a brief overview about the team behind this platform. This covers the authors of tutorials on Statistics Globe as well as the people behind the scene that are responsible for the appearance of the Statistics Globe website and its social media channels.
Former Team Members & Guest Authors on Statistics Globe
Some former team members no longer produce new content for Statistics Globe. However, you can find their previous contributions on their profile pages. Furthermore, Statistics Globe regularly features tutorials of guest authors.
Below, you can find all authors who have published articles on Statistics Globe in the past. You may click on the images to get further information about these authors as well as a list of their articles on Statistics Globe.
Social Media & Contact
A major goal of the Statistics Globe platform is to exchange with other statisticians, data scientists, programmers, and researchers of any field. Please let us know in the comments or on social media, in case you have any questions or topics you want to discuss!
In case you would like to follow Statistics Globe on social media or if you intend to contact us via email, you can find all details below.
- YouTube
- Facebook – Statistics Globe Page
- Facebook – R Programming Group for Discussions & Questions
- Facebook – Python Programming Group for Discussions & Questions
- LinkedIn – Statistics Globe Page
- LinkedIn – R Programming Group for Discussions & Questions
- LinkedIn – Python Programming Group for Discussions & Questions
57 Comments. Leave new
how are you sir
i want (ggplot ) for clusters for example bar plots of clusters
thanks
Hey Ahmed,
I’m fine and you? 🙂
Thank you for the topic suggestion! Are you looking for something like this? https://statisticsglobe.com/draw-stacked-barplot-in-r
Regards
Joachim
Hi Joachim,
Hope you are well.
Your tutorials are very helpful for understanding R programming for people without any background of the same.
I’m hardcore biologist and wish to do data analyses and statistical analyses using R. However, I’m having trouble in finding reference for arranging my data (data frame) for certain packages (e.g.: vegan, microbiome, phyloseq, etc.)
Can you help with these?
Regards,
Shruti
Hey Shruti,
I’m fine, and you? 🙂
First of all, thanks a lot for the very kind words! Glad to hear that you like the Statistics Globe tutorials!
Regarding your question: I’m not a biologist, so I have never worked with these packages. However, could you explain how your data should be modified? Maybe we can still find a solution.
Regards
Joachim
Thank you so much for your reply.
I deal with with environmental samples and I use molecular methods to draw certain inferences based on data that I get (metagenomic) and relating it to various environmental factors such temperature, pH, etc and sometimes intra-comparison within the samples. There are packages in R (Bioconductor actually) like the ones I mentioned which are widely used and a re known to help create and obtain different interpretations. And my major query is that for each of these packages, for R to read the file, the data has to be arranged in a particular way, so how do we know it or is there a way where changes in the script would make file readable
-Regards,
Shruti
The required data format depends on the functions and packages you are using. First, I would have a look at the help documentation of the function you want to use (i.e. write a ? in front of the function name and execute it in the RStudio console). This should give you an idea on how to format your data.
Once you know the required format, you can use the search function on Statistics Globe. I have created many tutorials on data manipulation already. Some tutorials that might be useful to manipulate your data:
https://statisticsglobe.com/reshape-data-frame-from-wide-to-long-format-in-r
https://statisticsglobe.com/r-order-data-frame-rows-according-to-vector
https://statisticsglobe.com/add-column-to-data-frame-in-r
If you don’t find a solution, you may post your question to the Statistics Globe Facebook group. In this group, people can ask questions about R programming and statistics. By posting there, others can contribute as well: https://www.facebook.com/groups/statisticsglobe
I hope that helps!
Joachim
Thank you so much for your reply. I would surely try this and see if it works.
-Regards,
Shruti
You are very welcome Shruti! I hope you’ll find a solution soon 🙂
Regards
Joachim
Yes, thank you. I hope to find a solution soon too ☺😇
Regards,
Shruti
I just wanted to say thank you for all of your videos, they have been incredibly helpful throughout the project that I’m conducting at the moment. I only started learning R a couple of months ago but I feel like I’ve come a long way thanks to your videos. Whenever I face a problem, I type it into youtube and the first thing that turns up is a clear video by you explaining exactly how to do it! Really appreciate all the hard work you have put into statistics globe, it is the best R resource I’ve used!
Wow, thank you so much Lasith! This is a wonderful feedback, and I’m very happy to hear that my videos helped to improve your R programming skills!
Good luck with your further learning progress! 🙂
Regards,
Joachim
Hello team, I had one year of experience in TECHNICAL CONTENT WRITING (freelance) and TECHNICAL CONTENT REVIEWER (3 Months experience)
Here’s my articles:
https://auth.geeksforgeeks.org/user/sravankumar8128/articles
I am interested to contribute my knowledge in your platform
On Python,R, PHP etc
Hi Gottumukkala,
Thank you very much for your comment!
I have emailed you to discuss further details.
Regards,
Joachim
Hello Joachim,
Hope you are doing well. I want to find out if you could do a video on Bayesian mixing models using R Statistics software. Thank you.
Hey Richmond,
Thank you for the topic suggestion! I’ll put it on my to-do list for future tutorials.
Regards,
Joachim
Hey Joachim,
I chanced about this website today, and I am really amazed how simply the tutorials are explained. They have helped me a lot in my project, I just want to say Thank you for your work.
Hey Maitreyee,
Thank you very much for this wonderful feedback! It’s great to hear that my tutorials are helpful for your project! 🙂
Regards,
Joachim
Hello i have a r project but little struggling, can i have your help? It s for put two charts with a diffferent x-axis beginning in the same point in the xaxis (0)
Hey Anthony,
Please have a look here for Base R plots: https://statisticsglobe.com/set-axis-limits-in-r and here for ggplot2 plots: https://statisticsglobe.com/set-axis-limits-in-ggplot2-r-plot
Regards,
Joachim
Hi, I have a question about sensitivity analysis in R, i fitted multinomial logit model, we have equations like this:
consider this equation for example,
V1 = ASC1 +a1×Inc+b1×HS+c1×Cld+e1×Dri+ f1×Edu+ g1×Loc+h1×HD+m1×Rnt
for example , V1 is the utility for having one car in the households, we have V0=0 as a base category to compare each category with this reference.
we have also V2, V3, V4 for having 2,3,4 or more cars,
we estimated a1,b1,c1, and so on by the multinomial logit model, now I want to test the change in household income; we want to test both a 25% decrease (reflecting a possible economic downturn) and a 25% increase; and see the effects on the predicted number of cars, I know I should use the equations resulting of the model and predict function in R , but i do not know how?
Best Regards
Hey Samin,
Unfortunately, I’m not an expert for sensitivity analysis. However, I have created a quick example on how to use the predict function, so hopefully this is helpful to you:
In addition, you may ask your question in the Statistics Globe Facebook group: https://www.facebook.com/groups/statisticsglobe
Regards,
Joachim
Hello,
I have built a package in R (with the name MSB), but I forgot to run the dependencies of this package first. So, I need to remove this package and reinstall it again.
When I write the CMD commands it gives the following error:
> R CMD REMOVE MSB
Error: unexpected symbol in “R CMD”
Can you please help me with this problem?
Hey Sarah,
Usually, you should be able to simply use the install.packages function to re-install a package. Have you already tried that?
Regards,
Joachim
hey sir im a statistics student a researcher and im doing a research on bringing an accuracy in classification in machine learning so i need some help in R can i contact you personally ?
Hey Iqbal,
Due to time-constraints, I do not offer 1-on-1 consultancy. However, I have recently created a Facebook discussion group where people can ask questions about R programming and statistics. Could you post your question there? This way, others can contribute/read as well: https://www.facebook.com/groups/statisticsglobe
Regards,
Joachim
Hi Joachim, not a request, not a question – but I just wanted to thank you and the team for the great work! I keep coming back to your website, since it has up-to-date, high-quality, reliable answers to just about any question I have regarding R. Keep up the amazing work!
Hey Alex,
Thank you so much for the amazing feedback, it’s really great that you find our content helpful!
Best regards,
Joachim
Hi, Sir, I have two years of experience in contributing 500+ articles on the following platforms
And I am also the reviewer of the same.
I know Python, R, etc
I can contribute python -pandas, NumPy, pyspark , R language, and other python related modules to your website
Here are my past PUBLICATIONS on the following platforms: https://auth.geeksforgeeks.org/user/geetansh044/articles
Hi Geetansh,
Thank you very much for the offer. I’m currently not looking for any further help, though.
Best regards,
Joachim
Hello Joachim,
Thanks for the wonderful. I’m trying to learn R and I have some struggles in solving an economic data set. Would you be able to help me please?
Thanks,
N
Hey Navya,
Thank you very much for the kind comment, glad you like the tutorials on Statistics Globe!
Sure, what is your specific question?
Regards,
Joachim
Hello, Joachim.
I am sorry to bother you with this question. I know it’s off topic, but I wanted to try at least. So I have this code in R:
my_pmatch <- function(q, size, lower.tail=TRUE){
if(!(floor(size) == size) | size < 1){
stop("size must be a whole number")}else{
for(i in 1:q){
if(q < 0){
return(0)} else {
return(print(my_pmatch (1,size)))}}}}
I want it to implement the p-series function for a Matching size distribution and return an answer for any numeric q. If you could give me any hints as to what is wrong with it, I would really appreciate it. If don't want to answer me, then no problem, I will understand. Thank you for your attention and have a nice day.
Hi Cecilia,
Thank you for your kind message. I just had a look at your function, and for me, it looks strange that you call your own function within this function (i.e. this line of code: return(print(my_pmatch(1,size)))).
Could you please explain what you are trying to do in this line?
Regards,
Joachim
Hello, Joachim.
Thank you for answering my comment. What I was trying to do with that line is for the code to return a numeric answer, using the p series in a Matching (size) distribution. I though that this line will command R to return a numeric answer, following the conditions above the line. I don’t know if I am explaining myself correctly. Thank you for reading my comments and for your help. Have a great day.
Hi again,
Thank you for the clarifications. I’m afraid it is not possible in the way you are trying to do this. The first part of your function looks fine to me, but this last line should be modified.
Unfortunately, I’m not an expert on p series in matching distributions, so I cannot help you in more detail. However, I have recently created a Facebook discussion group where people can ask questions about R programming and statistics. Could you post your question there? This way, others can contribute/read as well: https://www.facebook.com/groups/statisticsglobe
Regards,
Joachim
Thank you for answering my last comment, and thank you for all your help and for recommending me the Facebook page. I will check it out. Have a great day.
You are very welcome Ana, I hope the Facebook group was helpful!
Regards,
Joachim
I get error
data<-read.csv(file.choose(),sep=',')
data = data[ 1:1535,]
my_list2<-list()
for(i in 1:ncol(data)){
my_list2[[i]]<-data [,i]
}
names(my_list2)<-colnames(data)
my_list2
cfg <- compare_clusterings_configs(
types = "partitional",
k = 20L,
controls = list(
partitional = partitional_control(
iter.max = 20L
)
),
distances = pdc_configs(
"distance",
partitional = list(
ACF = list(
window.size = seq(by = 1L),
norm = c("L1")
)
)
)
)
evaluators <- cvi_evaluators("ARI", ground.truth = my_list2)
comparison <- compare_clusterings( my_list2 , types = "partitional",
configs = cfg, seed = 8L,
score.clus = evaluators$score,
pick.clus = evaluators$pick)
# some rows and columns from the results data frame
head(comparison$results$partitional[, c("distance","ARI")])
i get this
(Error in { :
task 1 failed – "'list' object cannot be coerced to type 'integer'")
Hey Ahmed,
Please excuse the delayed response. I was on a long holiday, so unfortunately I wasn’t able to reply sooner. Still need help with your code?
Regards,
Joachim
Hello Joachim,
It is a pleasure to come to this website because I can understand very easily what you explain. And I have gone to other sites to clarify how R works and I don’t see this didactic capacity that you have here. Thank you very much and congratulations
Hey Isaac,
Thank you so much for the very kind feedback! It’s really great to hear that you enjoy reading the tutorials on Statistics Globe!
Regards,
Joachim
Hi Joachim,
Hope you’re well.
I’m looking to develop the following in R.
https://github.com/MatthewSmith430/CompaniesHouse
I’m hoping you can help; can you please email me to discuss in more detail.
Thanks,
Hi Jack,
I apologize for the delayed reply. I was on a long vacation, so unfortunately I wasn’t able to get back to you earlier. Do you still need help with your syntax?
Regards,
Joachim
Hi , could you help please
Write three R functions called “Test1.R.Function()”, “Test2.R.Function()”, “Test3.R.Function()”, to perform the simulation study presented in section 4 “4 A Simulation Study” of the paper “Stavros_2004.pdf”. Repeat the simulation study as conducted and presented in section 4 for each test with only N = 1000 repetitions. Compare your results with the simulation results presented in tables:
Table I(a) and Table I(b) for Test 1
Table II(a) and Table II(b) for Test 2
Table III(a) and Table III(b) for Test 3
Hi Yousif,
I apologize for the delayed reply. I was on a long holiday, so unfortunately I wasn’t able to get back to you earlier. Do you still need help with your syntax?
Regards,
Joachim
Yes please
Hi again,
I don’t want to do your entire assignment for you. Could you please show the code you have tried yourself so far?
Regards,
Joachim
Dear Joachim,
I work on my thesis with R programming. I use catR package for my analises. I have trouble with genDichomatrix function. Because in that function there is a part called content balancing method.
if (!is.null(cbControl)){
props<-cbControl$props
names<-cbControl$names
props<-props/sum(props)
nr<-round(items*props)
f<-function(items,nr){
res<-matrix(0,length(nr),items)
for (i in 1:items){
res[,i]<-rmultinom(1,1,nr)
nr<-nr-res[,i]
}
return(res)}
pr<-f(items,nr)
g<-function(x) (1:length(x))[x==1]
it<-apply(pr,2,g)
groups<-NULL
for (i in 1:items) groups[i]<-names[it[i]]
groups<-as.factor(groups)
RES<-data.frame(a,b,c,d,Group=groups)
}
else RES<-data.frame(a,b,c,d)
return(RES)}
I want to modify this code to modified multinomial model. To do this 'props' part is need to be cumulative. But i dont know how to do is. Can you help me???
Hello Melike,
Maybe this tutorial of ours can help with cumulative proportion calculation.
Regards,
Cansu
Cansu Hanım merhaba,
Yanıtınız için çok teşekkür ederim. Yalnız ben yanlış fonksiyon üzerinde uğraşıyormuşum onu farkettim. Şimdi diğer fonksiyon için uğraşıyorum. İzniniz olursa özelden görüşmemiz mümkün mü acaba? Yani mailinizi alabilir miyim? Gerçekten alabilirsem çok sevinirim. Sevgiler….
Merhaba Melike,
Yardimci olmak isterdim fakat özel destek sunmuyoruz. Yeni fonksiyonla ilgili sorunuzu da buraya eklerseniz, yardimci olmaya calisirim.
Sevgiler,
Cansu
greeting, I am the ceo of small company, centraspheres. we are in area of human capital management. I am looking for company like yours for technical support, my email is as follow
Hi Hesam,
Thank you for your request. Currently, we don’t offer paid consultancy. In case you have shorter questions about your code, you may ask them in the Statistics Globe Facebook group here.
I hope this helps!
Joachim
Hi to the Statistics Globe team,
your tutorials helped me through a lot of issues with my code already. I now want conduct a Multi-Trait-Multi-Method-Analysis in R. I find the mtmm() function of the psy package not very intuitive. Multicon won’t install with my current version of R (4.2.0), however, it states more clearly what data structure is needed for the analysis. Do you know how data has to be organized for mtmm()? Or if it is possible and useful to still use the multicon package nowadays?
Warmly,
Anna
Hello Anna,
I haven’t run that analysis personally. But I could also recommend using the lavaan package if you have trouble with other packages. It is a commonly used package for factor analysis.
Regards,
Cansu
Hi Joachim and Statistics Globe team,
I would like to congratulate all of you for your tutorials, it is very easy to understand and super organized, especially for those who are R beginners (like me). I have a question that is not on your tutorials, but I bet you can help me solve this in a simple way. I have been spending some days to figure it out but I couldn’t.
I would like to group and match specific categorical variables (i.e., weekdays, hours of consuming certain foods, and eating occasions – i.e., breakfast, lunch, dinner, and others) by the IDs. It is a large dataset and it has two reports on two different days of consumption, in this scenario, I have to separate the two reports and take the average intake of each energy and nutrient consumed. I bet that the first step is to separate this into two datasets one with one record and another dataset with the second record. The database is in a long format, I think maybe I should convert to a wide format.
The dataset name is HEIFI_2019
Variables are COD_UPA (ID); Weekdays (Sunday to Monday); hours (0h to 23h); eating occasions (breakfast, lunch, dinner, and, other).
Thanks,
Hello Ana,
Thank you very much for your kind words. So you want to find the average food consumption per weekday and per eating occasion for each report separately? If so, would you like to group the data for each category separately or combined? Do you want to obtain twıo datasets, in the end, one for repot1 and the other for report2? What do you like to do with ID, I also didn’t get that well.
Best,
Cansu