A tiny case study in why you shouldn't autogenerate documentation

Cabbage

Seatwave's API documentation has got some sweet autogeneration of descriptions going on. That GetEventByID method? It Gets the event by ID. The GetUpdatedEvents method? It Gets the updated events. The GetEventsForEventGroup method? It Gets the events for event group. Isn't documentation great and helpful?

How about the EventsSearch method? Let's take a look at https://developer.seatwave.com/API/method/EventsSearch?apiName=discovery...

0_1517773360758_d8b0e9e8-a76a-45e3-a6bd-77473d99ec80-image.png

Hmm.

Cabbage

Additional detail: it's been this way since I encountered the API in 2012. Apparently they've managed to survive for 6 years as a business but at no point has anybody on their dev team felt any need to fix this.

Tsaukpaetra

@cabbage said in A tiny case study in why you shouldn't autogenerate documentation:

Additional detail: it's been this way since I encountered the API in 2012. Apparently they've managed to survive for 6 years as a business but at no point has anybody on their dev team felt any need to fix this.

What's the fix? Rename the function SearchEvents ? That would break everything that uses that function! Whargagablargh!

Zecc

@cabbage Also this:

0_1517775072711_f2bfd1c5-e131-4804-8d6b-019831906373-image.png

Gurth

@tsaukpaetra said in A tiny case study in why you shouldn't autogenerate documentation:

What's the fix? Rename the function SearchEvents ? That would break everything that uses that function! Whargagablargh!

function EventsSearch(foo, bar) {
    /* Fix for faulty auto-generated documentation */
    SearchEvents(foo, bar);
}

Maciejasjmj

Eventses the search

Yeah...

0_1517830222429_d259d9d2-ce94-44f8-a543-16d98b8ff817-image.png

0_1517830547653_8eefd3ae-7f78-4a56-b999-56df4f92c7af-image.png

0_1517831102901_653178e9-fe41-43e6-86fb-604042e61bd4-image.png

Also this:

0_1517831484460_1ddcb2a7-01f7-4b95-9c16-ad3355ab7fcb-image.png

Filed under: Returns: the result

TheCPUWizard

@maciejasjmj - Yes, default auto-generation can be bad. A few modern tools are getting better are the "English", and are also providing tracking for which elements have been reviewed/accepted by a human.

Is this a thread a "Tiny Case Study on the Risks of AutoGenerated Documentation" DEFINATELY. But provided one is diligent, it can be a great way to get to an initial "alpha" starting point.

Maciejasjmj

@thecpuwizard said in A tiny case study in why you shouldn't autogenerate documentation:

But provided one is diligent, it can be a great way to get to an initial "alpha" starting point.

I don't see the point. With no documentation, you provide zero information over what the signature can tell you. With autogenerated documentation, you provide zero information over what the signature can tell you.

Adynathos

Someone mandated that every function should be documented.

TheCPUWizard

@maciejasjmj said in A tiny case study in why you shouldn't autogenerate documentation:

@thecpuwizard said in A tiny case study in why you shouldn't autogenerate documentation:

But provided one is diligent, it can be a great way to get to an initial "alpha" starting point.

I don't see the point. With no documentation, you provide zero information over what the signature can tell you. With autogenerated documentation, you provide zero information over what the signature can tell you.

With the better tools, it goes beyond what the signature will tell you (but clearly is still limited to what the "code" will tell you).

The bigger element is exposure (availability of the information). As comments before the method, there no real value, but process those comments into a rich hyper-linked "document" (or site or wiki or) and the value can be quite significant, especially with larger codebases, where it is quite common to not have all of the source code on your machine.

The_Quiet_One

@maciejasjmj said in A tiny case study in why you shouldn't autogenerate documentation:

@thecpuwizard said in A tiny case study in why you shouldn't autogenerate documentation:

But provided one is diligent, it can be a great way to get to an initial "alpha" starting point.

I don't see the point. With no documentation, you provide zero information over what the signature can tell you. With autogenerated documentation, you provide zero information over what the signature can tell you.

He said it's a good starting point, and I agree. With many APIs many functions are self-explanatory anyways. I mean for, getEventByID(int id), I don't expect a novel, and if you do need a novel to explain stuff about this function, chances are your API is an utter failure. So, for those, auto-generation with an obvious description of the function and its arguments would suffice. Having someone spend tedious time manually constructing them is pointless.

And obviously if your API is a little more than a simple CRUD, you'll have certain functions that require a little more expansion and description. That's where you have a decent technical writer flesh that out.

JBert

@maciejasjmj said in A tiny case study in why you shouldn't autogenerate documentation:

@thecpuwizard said in A tiny case study in why you shouldn't autogenerate documentation:

But provided one is diligent, it can be a great way to get to an initial "alpha" starting point.

I don't see the point. With no documentation, you provide zero information over what the signature can tell you. With autogenerated documentation, you provide zero information over what the signature can tell you.

I see (poor) autogenerated documentation as a negative because now you will no longer get the "Missing XML comment for publicly visible type or member" warning.

Maciejasjmj

@the_quiet_one said in A tiny case study in why you shouldn't autogenerate documentation:

He said it's a good starting point, and I agree. With many APIs many functions are self-explanatory anyways.

Then they don't need any documentation beyond the method signature. (Usually they do, though. What happens if your getEventByID(int id) receives an ID for a nonexistent event? Does it throw? Does it return null? Does it return a null object?)

Autogenerated documentation is just deluding yourself into thinking you have documentation when in fact, you don't. It's not even that you have bad documentation, you have no documentation.

The_Quiet_One

@maciejasjmj said in A tiny case study in why you shouldn't autogenerate documentation:

@the_quiet_one said in A tiny case study in why you shouldn't autogenerate documentation:

He said it's a good starting point, and I agree. With many APIs many functions are self-explanatory anyways.

Then they don't need any documentation beyond the method signature. (Usually they do, though. What happens if your getEventByID(int id) receives an ID for a nonexistent event? Does it throw? Does it return null? Does it return a null object?)

Usually you're going to be consistent about basic CRUD operations, so the auto-generated documentation can just say what it does for each getByID. If you have CRUD operations for dozens of objects, do you really think it's a better thing to have someone just type out "This will throw if the ID doesn't exist?" over and over for each getByID function?

Autogenerated documentation is just deluding yourself into thinking you have documentation when in fact, you don't. It's not even that you have bad documentation, you have no documentation.

It only deludes you if you're delusional. I work for a group that actually thinks about things and knows the tools they use, not relying on auto-generation for things it's not designed for. You know, like pretty much every tool in existence, there are times using it is good, and times when it's not. You're making the usual fallacy that because stupid people might misuse a tool, it means the tool itself is bad.

TheCPUWizard

@the_quiet_one said in A tiny case study in why you shouldn't autogenerate documentation:

@maciejasjmj said in A tiny case study in why you shouldn't autogenerate documentation:

@thecpuwizard said in A tiny case study in why you shouldn't autogenerate documentation:

But provided one is diligent, it can be a great way to get to an initial "alpha" starting point.

I don't see the point. With no documentation, you provide zero information over what the signature can tell you. With autogenerated documentation, you provide zero information over what the signature can tell you.

He said it's a good starting point, and I agree.

Thanks!
|

With many APIs many functions are self-explanatory anyways. I mean for, getEventByID(int id), I don't expect a novel, and if you do need a novel to explain stuff about this function, chances are your API is an utter failure.

But is that self explanatory (from the signature I can not tell what will happen if you pas in an int which does not map to an Event) ????

The_Quiet_One

@thecpuwizard Yeah, read further. @Maciejasjmj brings that up, and I address it.

Greybeard

If you can’t document anything nice...

hungrier

@greybeard

/// Nices the documentation

Magus

I'm very much in favor of tools that don't generate any text in those tags. StyleCop analyzers may force some super-repetitive wording, but I find that that's as much help as I even want anyone to have writing comments. You have to actually think about what you're writing.

Too bad no one ever wants me to turn on Doxygen...

TheCPUWizard

@the_quiet_one said in A tiny case study in why you shouldn't autogenerate documentation:

@thecpuwizard Yeah, read further. @Maciejasjmj brings that up, and I address it.

You traveled back to time and pre-posted me ;)

Again, that ignores the linking capabilities when the comments are processed, as well as the ability for generation documentation to examine the body of the code (and base classes, and the classes used as parameters, et. al.) to provide the information in a clear concise form.

masonwheeler

What's eventses, precious?

https://nerdist.com/wp-content/uploads/2017/03/gollum-smeagol-singing.jpg

remi

@jbert said in A tiny case study in why you shouldn't autogenerate documentation:

I see (poor) autogenerated documentation as a negative because now you will no longer get the "Missing XML comment for publicly visible type or member" warning.

Case in point: I worked for a project that decided (years and years ago) that all functions should be documented and thus tuned whatever build script they were using at that time to spew a warning for each undocumented function.

As a result, one smart-ass ran a script that added a documentation block to each function, which contained "UNDOCUMENTED" and nothing else.

And thus all functions became magically documented, no more warning were spewed, and the manager could happily claim that all the code was documented.

TheCPUWizard

@remi said in A tiny case study in why you shouldn't autogenerate documentation:

@jbert said in A tiny case study in why you shouldn't autogenerate documentation:

I see (poor) autogenerated documentation as a negative because now you will no longer get the "Missing XML comment for publicly visible type or member" warning.

Case in point: I worked for a project that decided (years and years ago) that all functions should be documented and thus tuned whatever build script they were using at that time to spew a warning for each undocumented function.

As a result, one smart-ass ran a script that added a documentation block to each function, which contained "UNDOCUMENTED" and nothing else.

And thus all functions became magically documented, no more warning were spewed, and the manager could happily claim that all the code was documented.

Unfortunately not uncommon. This is why "Generation tooling" that can examine comments to see if they have been reviewed is so helpful.

dkf

@the_quiet_one said in A tiny case study in why you shouldn't autogenerate documentation:

do you really think it's a better thing to have someone just type out "This will throw if the ID doesn't exist?" over and over for each getByID function?

Hey, that's a use for cut-n-paste coding!

The awkward bit of documentation isn't “what does this function do” but “how do I use this function with the others”. It's very easy to have functions where their individual purposes are simple but the overall pattern is not.

Gąska

Most of these stupid doc lines wouldn't be a problem if developers remembered that all function names must be verbs.

Adynathos

@thecpuwizard said in A tiny case study in why you shouldn't autogenerate documentation:

Unfortunately not uncommon. This is why "Generation tooling" that can examine comments to see if they have been reviewed is so helpful.

Don't worry, someone will run a script to review the whole code.