Thinking about 'Bite-sized Mentorship' in ML Research

Education in Machine Learning in Africa

In the Machine Learning field in Africa, amazing initiatives, such as the Deep Learning Indaba, Data Science Nigeria, and Women in Machine Learning, have been building up a strong base of junior machine learning practitioners. They’ve largely done so by running formal and informal lectures, classes and conferences. These initiatives have been extremely successful, spurring on a number of other initiatives across the continent, resulting in budding communities of community-led junior ML practitioners. These entry-level ML practitioners have done introductory courses, worked through the mathematics, and completed a few competitive data science challenges (Kaggle, Zindi), while not formally going through training in University settings.

And this success is immense! But the question remains, what are the next steps?

In my experience at work, in Masakhane (a community of African NLP researchers) and elsewhere, mentorship has facilitated the most rapid growth in individuals - both personally being mentored, from people I’ve mentored or observed being mentored. It has allowed members of the community to move from ingesting existing knowledge, to guidance on how to contribute novel research and eventually publish papers. Unlike lectures which are focused on technical topics, mentorship captures “institutional” knowledge of how to write papers, avoid common mistakes, how to not get demotivated in research, and guides the research process. Unfortunately, we simply lack the required number of experienced ML supervisors in Africa, as well as have very low access to tertiary education in the first place.

Traditional Mentorship doesn’t scale

The problem is, unlike conferences or lectures, the traditional formal mentorship model does not work at scale. Lecturing, classes and conferences are useful because they work at scale. One person with knowledge can “release” knowledge on many others. And adding a few extra people to the equation does not impact the delivery of the lecture.

Conversely, formal mentorship is a long-term commitment between two individuals which often takes a high time and energy commitment from both the mentor and the mentee to maintain. Having been asked to be a mentor for individuals, despite my deep desire to help, my anxiety spikes at the thought of having to dedicate even more of my day to mentorship activities and inevitably I turn the request down. Having someone rely on me, when I have so many commitments is something I cant bare! I know this sentiment has been shared throughout many other people in research and non-research communities. The fact remains, that few people have capacity to maintain one-on-one mentorship relationships. To complicate matters, individual personalities and intrinsic motivation play a big-part in having a successful formalized mentorship relationship.

Given we have so many junior ML practitioners on the continent, little accessibility to more formal relationships such as supervisors, we need to rethink the mentorship model.

Rethinking Mentorship

The above led me to think about my own journey. I’ve rarely had someone who was formally assigned as my mentor, and actually fulfilled that role. But on the other hand, I have however been mentored EXTENSIVELY by MANY people. But if not by formal mentors, then by whom, and how did this happen? My informal mentorship, on reflection, has taken many forms:

Peer/Community mentorship. For me, this is the opening up and sharing struggles or roadblocks with friends, and building up communities of people I trusted. Even if they couldn’t solve the problems, the groups of individuals were always committed to brainstorming ideas, or would refer to trusted individuals. Even if no one has the answers, figuring things out together has yielded amazing results.
Asking questions to broad audiences. For me, this has been twitter and big slack groups. I regularly take the risk of sounding VERY dumb, but in doing so have opened up opportunities for individuals who do care enough, to respond. AND THEY ALWAYS DO! If people don’t know themselves, they’ll often share your question. You’ll get so many viewpoints from so many people! Pre-facing things with “I’m very new to this so this might be a stupid question, so apologies, but…” I believe has resulted in kind responses
Asking ad-hoc but VERY SPECIFIC questions to individuals. Sometimes I’m interested in a very specific answer to something. For example: “I find -insert technique here- confusing. Could you recommend me the best resource you’ve come across at explaining this concept?”. Or “I have no idea where to start on this issue, can I have 15 minutes if your time to get me started?”. Or even “I feel very demotivated, how did you get through this?”. It’s small and bite-sized and requires as little overhead on the individual you’re asking as possible.
Requesting feedback on output. If I have an idea, or a paper or a blog I’ve written, I ask individuals for feedback. It’s always easier to respond to an idea, than to come up with it from scratch so people sometimes don’t mind quickly reviewing something. If they don’t have time, then I make it clear that there is no commitment to do so.
Mentorship via collaboration. I’ve seen a number of fantastic ML individuals, who might never have supervised a student before, who have started working on small papers with beginner researchers. Working on something together means the individual goals are well-aligned and everyone benefits. Initiatives such as for.ai allow for this type of distributed collaborative research.

I believe some of the reason these strategies work is because:

They break the mentorship process into smaller chunks bite-sized for mentors so it didn’t feel like an ongoing burden or pressure.
They allow for many-to-many mentor-to-mentee relationships, and facilitated redundancy. When one specific person isn’t available, someone else with experience is. Trust in a community is more robust than trust in an individual.
They facilitate aligned priorities.

I dub this group of techniques “Bite-sized” mentorship. My hypothesis is that these ideas could be a more scalable approach to mentoring budding researchers on the African continent (and perhaps elsewhere). I also believe that this style of mentorship is the most sustainable.

So what are those “next steps”?

If you’re someone seeking a mentor, but struggling to find someone, I encourage you to try the follow:

To seek out or slowly build communities of people who care about the same things as you. This will enable peer mentorship and mentorship via collaboration.
Learn how to ask good specific questions. This makes it much easier for a mentor to reply.
Be brave and pro-active in your approach to getting mentored. Ask questions that seem dumb - everyone was in your shoes at some stage.

If you’re someone willing to mentor, but are nervous about the responsibility or time commitment, I encourage you to seek out these opportunities for “bite-sized” mentorship:

Partner with an existing initiative, even if it’s abroad (especially if it’s abroad). Do Q&As, join meeting now and again, encourage people, learn from people.
Actively follow junior researchers on Twitter, so you can see their struggles and can advise, and also be cogniscent of where you can learn from them!

Thank you for reading this and thank you to all my informal mentors! There are many of you and thank you for the time you’ve taken to respond to my requests. Would love to hear other people’s strategies and techniques and ideas on how they think about the future of mentorship!

DISCLAIMER: I have no background in psychology or sociology. I have not got any formal education in this or studies around this. I would REALLY LOVE for someone who does have research and formalized expertise to help refine or challenge these thoughts. This is just a place to capture my thoughts, observations and experiences and pose the questions

Context: These observations are from observing ML communities in Africa. Perhaps other disciplines, or communities would not apply