May 27, 2013

I made a MOOC, and I survived

Xavier Lagrange, Alexander Pelov and I made a MOOC introducing Cellular Networks!

It is supposed to be a 20-hours course for students with a minimum background on networks. It attracted around 350 students, including 35 students from my institution for which this course is part of the curriculum.

I do not discuss here our motivations to create a MOOC and the way students experience it. I focus on the teacher's standpoint when making this MOOC.

We decided to make our own MOOC from scratch without using external products (except YouTube to host video). In other words, we did not use third-party companies like Coursera and Udacity, which host content, advertise it, ensure hotline for technical troubles, and so on.

Time Spent
On a very rough estimation, we spent 240 hours on this MOOC, including:
  • 20 hours preparing the pedagogical material. This part is actually enjoyable for teachers. To transform classic 3 hours-long lectures (including jokes) into 7-minutes-long to-the-point videos (+ quizz) is actually a nice challenge. Though, there is room to do more: we did not change much our exercises. Moreover the only collaborative tool we experimented is peer reviewing for homework. In other words, the transition to c-MOOC would require more time.
  • x hours interacting with students. Since our MOOC was not that crowded, x was close to epsilon but I guess there should be some formulas linking teacher interaction time and number of students.
  • 30 hours installing and testing the MOOC platform. We chose OpenMOOC because it was the only available, viable, open-source platform at that time. It is an overall good basis but it is relatively hard to install for people who are not familiar with server administration. Moreover, statistics modules are very incomplete. But, still, OpenMOOC is OK, it provides basic functionalities and the teacher interface is friendly.
  • 180 hours generating the video of courses, including:
    • Warm-up: it is all but easy to write on a tablet while watching a camera, to master all recording elements, to feel comfortable, to find the right tone and the right pace. For each teacher, the first tries recording videos were disastrous.
    • Recording: we made a lot of errors, for example to speak during twenty minutes with microphones off. Even when everything runs perfectly, speaking for such video is totally different from lectures. Overall, 2 minutes of recorded videos ended up into 1 minute of online video.
    • Producing: we discovered how to use a studio software. With nowadays tools, it took us in average 10 minutes to deal with each recorded minute, so 20 minutes per each finally online minutes of video. And we had around 90 videos with average 6 minutes. 
  • 5 hours advertising. We were not affiliated with a well-known platform, so we needed to attract people (despite the aridness of the topic). Clearly, it was not enough.

Process
The teacher should first decide how to cut a full course into units, each unit being four to ten chunks, each chunk being a 7-minutes long video. We opted for the format that has been popularized by Khan Academy. This format is now widely used all over the platforms: the background is (almost) empty, the teacher speaks in the background, we sometimes see his/her face, and the most important point is that he/she writes on the slide when he/she speaks.

Our process was to first create the target slides, i.e. what we want to have at the end of the video. Then, we only kept what is actually hard to draw in real time, for example a hexagonal ceiling. This is the background slide. The goal of the video is to start with the background slide, to write on it, and to finish with something that is close to the target slides. During the recording, the target slides were displayed behind the camera so that the teacher does not forget anything (and look at the camera).

We decided to not use a prompter because we wanted to keep it as natural as possible. We did not write  the discourse in advance, but Xavier and Alex master the topic so well that they did not need it. Note that it is also possible to pause during the recording so that teacher can take time to think to the next sentences. It is also possible to repeat something in a better way when the previous sentences were not totally satisfactorily. These pauses and repeated sentences can be cut afterwards.

To generate the first videos, we used software, cams and microphones that we found in our shelves. We were able to generate some videos but the overall quality was borderline. Then, we got some extra fundings and we were able to get professional materials and to build our own studio. The quality is far better. Our studio includes a tablet, a powerful Mac with enough hard-disk (we needed around 1 Terabytes for this MOOC only), some wireless microphones, and a semi-professionnal camera.

Stuff I Would Have Made In a Different Way
I put here miscellaneous thoughts:
  • We would have chosen a sexier title. In our case, it would have been appropriate to include a buzzword like LTE, LTE-advanced or femtocells in the title. We identified three ways to attract a large population of students: 
    • The MOOC is affiliated with a top-ranked university, which knows how to advertise, or with a highly-visible platform like Coursera. These websites attract millions of visitors, so you can enroll thousands of students. It is possible that these enrolled people are less committed to complete the course, though;
    • The MOOC is about a very trendy topic, say quantum computing, software-defined networks or any other buzzword. It has to be reminded that the majority of "MOOC students" are professionals who want to keep in touch with new topics they heard about;  
    • You make the buzz about your MOOC. We spent only 5 hours advertising and we are not professional. Press and web buzz campaigns is a way. It is also possible to convince fellows from other universities to make their students enroll your MOOC.
  • We would have found a better video format. Let's be honest: without a dedicated team, it is a indecently long and fastidious to create Khan Academy-like videos. We went too much when it was about videos. Compare this video and this other video. It took us 20 minutes to produce each minute of the former video while it took us only 4 minutes for the latter. Is it worthwhile? Based on our experience, it is probably possible to divide by at least 3 the overall time spent on video.
Conclusion
Overall, it was a great experience. We learned a lot about the potentials of such online courses and we had a lot of fun playing with videos. We developed a lot of nice ideas for the next MOOC, and we significantly improved the process of video recording and editing.

But it was also a huge investment. Xavier told me that making this MOOC was as demanding as writing a book. I often compare books with MOOCs when I have to explain our motivations to do MOOC. Both are knowledge, both are supposed to be done by experts, both target a wide population… it seems that both require very committed authors.

January 29, 2013

MOOC and Grandes Ecoles: surfing the tsunami

In North-America, the development of Massive Online Open Courses (MOOC) is seen as a panacea, a way to fix some of the multiple flaws of the higher education system. From a buzz standpoint, this belief culminated when Stanford president claimed "there is a tsunami coming." The debate is less lively in France. Yet, the tsunami would have good chances to affect the French higher education as well.
An originality of the French higher education system is the prominent position of Grandes Ecoles. I am working in one of them, Telecom Bretagne, which is part of the Institut Mines-Telecom.

Risks

My first claim is that Grandes Ecoles are in danger. Reasons include:
  • Grandes Ecoles have to face new competitors. The emergence of start-ups like Udacity and Coursera has transformed Grandes Ecoles into an oligopoly of dinosaurs. Grandes Ecoles are used to the competition among themselves. In short Grandes Ecoles share the "market" of producing highly-qualified professional students, which is a market that universities do not address accurately. The reality is that all Grandes Ecoles have approximately the same offer: roughly same size, same structure, same diploma, same normalized courses. The aforementioned new competitors are start-ups with limitless ambition and nothing to lose. They have almost no administrative cost, they do not waste money in research, they can fail and revise their strategies on a month basis, they can address students worldwide. These start-ups actually shuffle the cards.
  • Grandes Ecoles' main asset is diploma in a certification world. Companies like Cisco and Microsoft have developed professional certification systems for years. Students become super-expert in a given specialty and receive a certificate, which demonstrate their employability. Though, these efforts have never disrupted the high-education system. By offering individual courses, MOOCs challenge again the notion of degree, which is commonly seen as a set of certifications (including many useless ones). The companies that are not convinced by the degree system will find in MOOC a great opportunity to revisit their Human Resources processes and to bypass Grandes Ecoles.
  • Grandes Ecoles are not attractive for the targeted students. Grandes Ecoles are very attractive to bright French students, but MOOC's target population is all over the world. Unfortunately, Grandes Ecoles are visible neither in international rankings, nor on the web. Grandes Ecoles have also not demonstrated strong relationships with companies that really matter to students (especially Apple and Google). Finally, the perspective to live in France during three years for courses that are not all considered as worthwhile is a key weakness.

Opportunities

My second claim is that Grandes Ecoles are in an excellent position. Here is a selection of advantages.
  • Grandes Ecoles are adaptive. These are small institutions, which are directed by managers having a long experience in industry. Grandes Ecoles are far more flexible and reactive than any other institutions. They can re-organize, they can develop strategical plans, they can reinvent themselves and they can embrace new ways to fulfill their missions without delay. MOOC is an opportunity for Grandes Ecoles to develop new businesses and to improve their offers.
  • Grandes Ecoles excel on what complement MOOCs. It is a common understanding that MOOC is about knowledge. A set of MOOCs is not enough to turn students into smart workers. Many other competencies should be developed, including team-working, communication skills, and social networking among classmates. Grandes Ecoles focus on these aspects through project-based pedagogy, personalized and tutored curriculum, campuses designed as learning centers, and good placement in attractive companies. In Europe, Grandes Ecoles excel in all these aspects and find here a way to differentiate in a positive way from other institutions. Grandes Ecoles can leverage MOOC rather than suffering from MOOC.
  • Grandes Ecoles have already a strong relationship with companies. Curricula are typically discussed with companies on a regular basis such that learning matches the requirements of targeted employers. Grandes Ecoles also have developed programs for "continuing education" in relationships with Human Resources. Thus Grandes Ecoles are used to the act of selling learning programs elaborated by their faculties in a business perspective. The diploma is a virtual good that has made sense since the XIXth century, often challenged but never surpassed because companies like employees who are more than just a super-expert in a couple of areas.
The next couple of years will be key for Grandes Ecoles. It will be very interesting to observe what the executives of Grandes Ecoles will do. Undoubtedly, executives will have to be brave if they want to transform their institutions. They have to make Grandes Ecoles able to compete at the planet scale, to leverage their assets, to catch up emerging trends, and to focus on what is really making Grandes Ecoles unique learning places. Strong decisions will have to be taken. For example: giving up with academic research to re-focus on education, cutting faculty jobs in departments that have no activities in core scientific domains, developing business related to buying/selling MOOCs...

This increasingly frustrating peer review process

Academic people barely share their bad personal experiences related to peer reviewing. But everybody has papers rejected in conferences… and these decisions sometimes generate legitimate frustration since they seem to be due to some "random bad outcome from this plain old flawed reviewing process". On my side, I have the feeling that reviewing process is getting worse and worse. I am not alone. Following this example, I describe below some recent reject notifications that illustrate some of these flaws. And I propose some ways to fix them.

The un-rebutted rebuttal
In 2012, both ICME and Sigcomm conferences introduced a rebuttal in the reviewing process. I know a lot of scientists who call for such rebuttal process. Unfortunately, my experience of rebuttal was absolutely disastrous on both cases. It is interesting to note that these conferences are definitely not in the same league.

For ICME, I suspect one of the reviewer to be a weak graduate student: he gave us a strong reject based on his claim that one of the four proofs of the paper was wrong on a specific equation. Unfortunately his mathematical statement was false. This bad review was the perfect case where a rebuttal can help to fix a clear misunderstanding and a wrong analysis. We spent a significant part of our rebuttal trying to politely fix the mathematical error of this reviewer. Hélas, we received our negative notification. The reviewers did not change any word of their review. And the meta-reviewer gave us this unforgivable remark: "The authors thinks that the reviewer 2 misunderstand the work in this paper. From the comment, the reviewer should be an expert in this field". This meta-reviewer does not understand rebuttal, does he?

For Sigcomm, one of the reviewers claimed that our 14-pages long proposal can be done by tweaking another existing system. More precisely, the reviewer "believes that with simple changes to your problem, one can use the [other] system to tackle it, probably by just changing the utility function." We knew well this said other system… and we double-checked again. No, there is no way, both papers share some words, but they are like apples and oranges. However, this was the main strong drawback raised by this reviewer, so we were full of hope that we could make our case by carefully explaining the differences with this previous work. Hélas, triple hélas, one month later, the reviews arrived, unchanged.

In both cases, rebuttals came back without any changes, even when we highlighted some major wrong analysis.

Proposal: I don't believe much in rebuttal, but at least this proposal deserves a better implementation. In particular, reviewers must address the remarks that authors made about their reviews.


The anonymous reviewer
We submitted a reasonable paper to a special issue of IEEE Transactions on Multimedia. One reviewer was vaguely positive, one reviewer was vaguely negative, and then came the third reviewer… This guy did not find any positive comment to do. It looks like none of these 14 pages was worth anything. Moreover, all his negative comments were excessively aggressive and mostly based on wrong self-proclaimed facts. The review was just a piece of harsh and assertive remarks. This paper was not a Nobel Prize, for sure, but it was a honest, valid paper, with a motivation based on a series of observations from well-established measurement systems, some theoretical developments, and a non-trivial simulation. Maybe not worth a publication in this journal, but why so much hate?

One well-known issue of peer reviewing in computer science is the excessive harshness of reviewers, often young scientists, comfortably protected by the anonymity. In the excellent "Guide for Peer Reviewing", it is said that, as far as possible, the first paragraph of a review should summarize the goals, approaches and conclusions of the paper (including positive assessments) while the second paragraph should provide a conceptual overview of the contribution.

Proposal: Some reviewers would be less assertive, and less aggressive if there were any probability that their identity would be revealed. Why not having a "out of the k reviews you do for a conference, one of them will be randomly chosen to be de-anonymized." Or a "one out of ten reviews are de-anonymized".

The no-room-for-cold-topics program chair
We sent a P2P paper to Globecom, although it is well-known that P2P is now a very cold topic. We received two clearly positive reviews, and one review slightly more negative in the grades, but with comments like "The addressed problem is relevant, the paper is well-written and technically solid". Globecom has a 37% acceptance ratio, but despite these grades, our paper has been rejected. My first reject at Globecom.

I asked some additional explanations to the TPC chair, and he kindly answered that "in the confidential comments, there was a voiced concern about novelty". In other words, it seems that anonymity is not enough for reviewers, they still require an even more anonymous place to assess the judgements they are the less proud of. According to the guide of peer reviewing, the "confidential comments" are just a bad habit, which affects the overall transparency of the reviewing process. On my side, I never use it, and I don't find any convincing point for using it.

Proposal: ban the confidential comments.

December 5, 2012

Brewing storm on cloud gaming. Are CDNs the saviors?

Cloud gaming has the potential to become a revolution in the way games are developed and distributed. Instead of requiring end-users to buy powerful computers to play modern games, cloud gaming performs the game computation remotely with the resulting output streamed as a video back to the end-users. Cloud gaming is thus expected to meet the demand of both gamers (who want platform independence), and game developers (who want to reduce their development cost and to gain flexibility in game updating). The enthusiasm has however been severely chilled when the main actor in the area, namely OnLive, ran out of money.

This debacle is not surprising. Many of cloud computing’s core design tenets conflict with cloud gaming. Cloud providers only offer general purpose computing resources that are located in a relatively small number of large data-centers. Unfortunately, these architectural decisions are in conflict with the needs of cloud gaming, which are interactive (hence highly latency-sensitive), and require specialized hardware resources, such as GPU and fast memory. Furthermore, many cloud data-center locations are chosen to minimize cooling and electricity costs, rather than to minimize latency to end-users.

Despite these drawbacks, many analysts still believe in cloud gaming. In a near future, the number of users served by physical machines should grow, data-centers should include GPUs, game engine should be re-designed… OnLive may just have been too early.

Still, the question of the number and the location of data-centers remains. Past studies have found that players begin to notice a delay of 100 ms. However, at least 60 ms of this latency should be attributed to playout and processing delay. Therefore, 40 ms is the threshold network latency that begins to appreciably affect user experience. 

In a recent academic paper, which was presented during the ACM Netgames conference, we have performed a large-scale measurement study to determine :
  • the percentage of population that can be served with today's cloud infrastructure. With EC2 infrastructure, less than 40% of population can play highly interactive games (network response time 40 ms), and only two third of population can play the least demanding games (network response time 80 ms). See Figure below.
Population covered by EC2 cloud infrastructure
  • the number of data-centers that are required to have a decent population coverage. Unfortunately even if 20 data-centers are deployed, less than half of population would have a network response time of 40 ms.
  • the gain of using a CDN infrastructure. They are significant. Today's CDN servers do not host GPU, and they are not designed to serve a very small number of users (only 8 users per server with state-of-the-art technologies). But who knows what will be CDN next strategical moves? Our study shows that embracing cloud gaming is a very good idea for CDNs, isn't it?
Population covered if EC2 is augmented with CDN smart edges

October 30, 2012

Lessons learned at UWaterloo (2nd part): research organization

Here is the second post about my experience at University of Waterloo. After the ode to the co-operation education program, here is another positive observation related to research organization. All in one, I have the feeling that the time spent in meetings by researchers in North-America is four times less than their European counterparts. I wish statistics could support this claim. Why so? I identified at least two explanations.

Firstly the structures fundamentally differ between American and European research institutions. It might sound like a caricature, but Americans promote individual successes while Europeans build large corporations. A research department in an American university is an aggregation of independent researchers, who manage their own team of students and research fellows with their own budget, and who develop their own line of research. In Europe, senior researchers should gather into so-called research teams, which are expectedly consistent. European teams should define a strategy, share budgets and generate activity reports to justify they still deserve to exist. They are regularly challenged by numerous "administrative research managers", who are no longer researchers, but whose job is to "organize". Obviously, I think that the model where a department is like an incubator of entrepreneurial researchers is the right model. American researchers focus on their team, spend time on their own activities and are committed to succeed in academy by any mean. European researchers waste their time in meetings and in bureaucratic activities. Furthermore, the former model enables paradoxically better collaborations among researchers because these spontaneous collaborations are not based on any explicit agreement.

Secondly, research funding target individuals, not collaboration. Europe is crazy about "calls for collaborative projects". Our beloved European funders seem convinced that the only way to do research is to make people work together on a well-defined topic. European companies interested in academic research contribute to research through collaboration in these projects. I will again exaggerate a bit, but collaborative projects are not the norm in America. Researchers get small amounts of money from companies through direct grants, which favor transient, short, focused cooperation. And they also receive individual grants from their public funding agencies. Such model does not force researchers to waste a significant amount of time at collective writing, synchronization meetings and expense justification. Ask American researchers who experienced European projects if they want to do it again. Their answers are likely to be harsh about this crazy administrative nightmare.

I found this research organization far more efficient since it allows researcher to focus on their core activities. And I am afraid that the situation gets especially worse in France because I see a growing number of "administrative managers" who gravitate around the academic world. They are expected to be "interface" between researchers and funders, but their job (to organize researchers and to set up "calls for projects") actually interfere with researchers.

October 19, 2012

Lessons learned at UWaterloo (1st part): the Co-operative Education program

I spent one year at University of Waterloo in the Electrical and Computer Engineering department. I will try to extract from this fruitful experience a small set of short lessons. Here is the first one, an ode to the so-called Co-operative Education system.

What Waterloo calls a co-operative education system is a dual education system, which combines academic studies and professional works. We also have such program at Telecom Bretagne (my employer), see this link (only in french). All undergraduate programs are "co-op" at Waterloo. According to rankings based on surveys, it works pretty well for them (#29 for Business Insiders).

In France, dual education system at the graduating level is not the most prestigious curriculum. The "royal way" consists of two harsh years focusing on abstract mathematical concepts, a success in a ultra-competitive exam, then three years in an engineering schools at socializing (i.e. partying) and specializing. Students enrolled in dual education system are not considered as the best because they have not demonstrated outstanding scholar skills at the ultra-competitive exam. I already expressed serious concerns about such curriculum in a modern (i.e. computer oriented) society.

On my side, if I had to hire one engineer in my research team or in a start-up, I would definitely hire a "dual education" engineer. As far as I can see at Telecom Bretagne and UWaterloo, students have a lighter scientific background on the fundamental areas. However, they just program well. And they just work well. I have to admit that, like probably most companies, I value "programming well and working well" far higher than "having a strong background on fundamental scientific areas".

Dual education is an efficient way to teach software programming and project management, since, in general:
  • Teachers don't code. You can't count on them to learn programming tricks.
  • Teachers can't catch up all new technologies. Nobody is an expert in MapReduce, Ajax, and ObjectiveC at the same time. You can't expect that from teachers neither.
During academic terms, teachers can focus on the fundamental of programming and they can skip the courses about languages and technologies. During their terms spent in companies, students can discover the latest technologies and code with professionals. Good match, ins't it?

July 11, 2012

On the pivotal role of post-doc in research groups

A sabbatical is a great opportunity to study the internal process of other research groups. For a young European scientist like me, there is much to get from observing what American professors implement for the management of their large teams. In general, academic people are reluctant to apply industry-like management processes, which are supposed to prevent groups to be creative and lively. However, the research is becoming objective-driven, and the raise of grant agencies and project-oriented funding makes that an unexperienced young professor can now receive enough money to hire a large team of researchers. There is a need to address the tabooed topic of research group management.

A unique characteristic of research groups is the huge gap between a graduate "master" student, who has some scientific background but actually knows nothing, and a senior post-doc, who is supposed to be a peer, fellow researcher. One of the purposes of a research group is to assist the development of new skills… and to let people leave when they are, at last, autonomous and efficient.

It is my understanding that post-docs are the most challenging to work with. Some of them are super-PhD, while some others are rather mini-professor. Most of them do no longer require to be "tutored", but they still have to acquire some critical skills. In particular post-docs should ideally develop their own ideas from scratch, tutor younger students, actively contribute to industrial collaborative project, and expand their professional network.

On the one hand, delegating is dangerous for a professor. If a collaborative project fails, the one that will be directly affected in the future is the professor, not the post-doc. If young students spend too much time working on barely publishable ideas, utilize old-school technologies and miss latest exciting papers, it is the professor who eventually has to make the student catch up, not the post-doc. On the other hand, delegating is a real chance to expand the research group, to work on new areas, and to offer the opportunity to the new post-doc to acquire critical skills/experiences she misses.

For these reasons, the management of a post-doc is very touchy. On my side, I consider the following:
  • before opening a post-doc position, I will clearly define my needs. I identified at least three critical needs that might pop up someday, and that would justify opening a post-doc position:
    • I want to be assisted for tutoring young students
    • I want to partly delegate the management of a heavy collaborative project
    • I want to explore a well-identified brand new topic
  • once my needs identified, I hope that finding a matching post-doc will be easier. Compared to my previous experience, I will pay more attention to the actual motivations of candidates, and I will pay less attention to their background or their past achievements. Of course it is important to know whether the candidate can write paper or produce experimental stuff at the expected level of quality for a top academic conferences, but it is even more important to know what is the status of the candidates regarding the need.
  • then, if objectives are clear and well-understood on both sides, I guess the collaboration shall be more productive.