Machine Learning

I would like to, first, apologize for the misleading title.

Wikipedia defines “machine learning” as:

Machine learning is a field of computer science that uses statistical techniques to give computer systems the ability to learn (e.g., progressively improve performance on a specific task) with data, without being explicitly programmed.

But that is not what this post is about.

You see, as a junior scientist about to embark on your career path, one thing you should do is learn to use different machines. Some machines will make something for you, whereas others will tell you that those machines are probably not working, and despite all your efforts, it will be some time before you start getting good samples or results out of any thing in your lab.

So if you work in a lab, you’ll now have a very good idea of who rules there: It’s them. Now and forward, in your time as a researcher, you’ll go into many labs, meet a lot of people, and learn a lot of new stuff… but nothing and no one will judge you like those silent machines sitting smugly atop work benches of your lab.

Many a times, I have started working with one, thinking, oh this is going to be one of the easier ones. Never has that attitude worked for me. It’s like you cannot trust any single one of them; even the ones that you think you know, won’t think twice when it is as tempting as shattering your trust in them.

They don’t care if you are enthusiastic about research and have chosen this field as your career with such joy. They will happily make you think that you have nice typical graphs* in your presentation that you are about to show in your group meeting. They want you to suffer, to taste embarrassment, like they want to see how long you are going to last in this field that you have been so passionate about …

… Until you have proven yourself worthy of research.

As a result, one of the few things I have always found myself doing in the beginning is to start gaining favor of the machines that are in my lab. To build good rapport with them. And it is not easy, not with every machine. Even after you feel you two are going along well, they will still  test you, or just throw you under the bus when they feel like it**.

If it all sounds depressing to you, let me tell you there is hope. Somehow, after you have stood beside them long hours and worked with them at odd times***, they will start to slowly accept you and let you enter the ranks of researchers. It’s blood, sweat and tears to gain that kind of trust, but you can gain the status of meh-you’ll-do in the eyes of the machines.

But don’t ever expect them to start liking you, because that’s ridiculous, they probably don’t have a heart.

 

 * Well, okay, they may have been looking a little weird to you as well, but early on, you wouldn’t know that, thinking this is probably how they are supposed to be.

** Which tends to happen most on Fridays, followed next in probability by Mondays.

*** Condition of high-levels-of-consistency needs to be met with these requirements of long-hours and odd-times.

What Doesn’t Make It To THE Thesis

I recently attended a PhD seminar of a student at the Physics department. One interesting question that came up was: What didn’t make it to your thesis?

After deep thought and much analysis, I have realized this is such a good question. It let the candidate show other things he did and learnt in his PhD that didn’t make it to his thesis, because they may have been slightly irrelevant or might have been disregarded as not-important-enough.

But with this question, he could highlight this stuff because he still did those during his PhD (or more likely, he could highlight this stuff well if he’d been prepared for such a question to come up – this may not be your average kind of opponent-speak).

But hang on, is it just this stuff that doesn’t make it to your thesis?

When I was doing my Masters, we’d always joke about how everybody should write 2 theses: One the more formal, “required” document; the other, a document of “failures”: the questions that we asked, and what we tried in the lab, that then didn’t work out, and that we then dropped… which then obviously wouldn’t make it to the thesis. The kind of information that you jot down in the side margins of your lab notebook, along with angry and frustrated emoticons and hashtags and exclamation marks.

Jargon that only the person who owns the lab notebook may understand – And now I realize how I don’t make much useful or interesting notes in my lab notebook :'(

The kind of information that doesn’t make it anywhere.

Because that information is also useful, although it mostly just tells you what NOT to do (which is quite the time-saving information). But this kind of information doesn’t have much space in the scientific world. Like people in all other fields, we just want to make noise about the best work that we do, disregarding the important stepping stones that our failures were that got us there.

So yeah, a lot of stuff doesn’t make it to the thesis at any level (or any important document, for that matter – except may be some internal lab communication).

There’s also a lot of other stuff that doesn’t make it to the thesis: the hardwork and energy that you put in, how you picked yourself up after repeated failures, how your informal collaborations helped you (and the people around you)…

… Although if you end up doing really good work, the thesis can become a reflection of it.