* strtok - strings without delimiters
@ 2013-05-12 20:22 Georg Sauthoff
[not found] ` <20130512202212.GL21458-w6h1d7rV6+VwFLYp8hBm2A@public.gmane.org>
0 siblings, 1 reply; 4+ messages in thread
From: Georg Sauthoff @ 2013-05-12 20:22 UTC (permalink / raw)
To: mtk.manpages-Re5JQEeQqe8AvxtiuMwx3w; +Cc: linux-man-u79uwXL29TY76Z2rM5mHXA
Hi,
the strtok man-page does not specify what happens if `str` does not
contain any `delim`iters.
If you don't already know strtok one might assume then either:
- the delimiter-less string is considered the next token and thus
returned upon the first call to strtok
- on the first call NULL is returned because there are not any tokens
Related issue: is the non-empty `str`-suffix after the last `delim`
returned as a token or not?
I know how strtok works - but perhaps it makes sense to explicitly
address these issues in the Linux man page.
Best regards
Georg
--
To unsubscribe from this list: send the line "unsubscribe linux-man" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread[parent not found: <20130512202212.GL21458-w6h1d7rV6+VwFLYp8hBm2A@public.gmane.org>]
* Re: strtok - strings without delimiters [not found] ` <20130512202212.GL21458-w6h1d7rV6+VwFLYp8hBm2A@public.gmane.org> @ 2013-05-19 20:46 ` Michael Kerrisk [not found] ` <51993A39.3060606-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> 0 siblings, 1 reply; 4+ messages in thread From: Michael Kerrisk @ 2013-05-19 20:46 UTC (permalink / raw) To: Georg Sauthoff; +Cc: linux-man-u79uwXL29TY76Z2rM5mHXA Hello Georg, On 05/12/13 22:22, Georg Sauthoff wrote: > Hi, > > the strtok man-page does not specify what happens if `str` does not > contain any `delim`iters. > > If you don't already know strtok one might assume then either: > > - the delimiter-less string is considered the next token and thus > returned upon the first call to strtok > - on the first call NULL is returned because there are not any tokens > > Related issue: is the non-empty `str`-suffix after the last `delim` > returned as a token or not? Thanks for the report, though I must say that it would have been a little more helpful if you had answered the rhetorical questions you ask... (It would have aided me as I reminded myself of the details.) > I know how strtok works - In which case a patch would have made the bug report even better still! > but perhaps it makes sense to explicitly > address these issues in the Linux man page. Anyway, I agree that the page could be clearer. I've added the following text: A sequence of calls to strtok() that operate on the same string maintains a pointer that determines the point from which to start searching for the next token. The first call to strtok() sets this pointer to point to the first byte of the string. The start of the next token is determined by scanning forward for the next nondelimiter byte in str. If such a byte is found, it is taken as the start of the next token. If no such byte is found, then there are no more tokens, and strtok() returns NULL. (A string that is empty or that contains only delimiter will thus cause strtok() to return NULL on the first call.) The end of each token is found by scanning forward until either the next delimiter byte is found or until the terminating null byte ('\0') is encountered. If a delimiter byte is found, it is overwritten with a null byte to terminate the current token, and strtok() saves a pointer to the following byte; that pointer will be used as the starting point when searching for the next token. In this case, strtok() returns a pointer to the start of the found token. From the above description, it follows thatt a sequence of two or more contiguous delimiter bytes in the parsed string is con‐ sidered to be a single delimiter, and that delimiter bytes at the start or end of the string are ignored. Put another way: the tokens returned by strtok() are always nonempty strings. Thus, for example, given the string "aaa;;bbb,", successive calls to strtok() that specify the delimiter string ";," would return the strings "aaa" and "bbb", and then a NULL pointer. Sufficient? Cheers, Michael -- To unsubscribe from this list: send the line "unsubscribe linux-man" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 4+ messages in thread
[parent not found: <51993A39.3060606-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>]
* Re: strtok - strings without delimiters [not found] ` <51993A39.3060606-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> @ 2013-06-01 20:58 ` Georg Sauthoff [not found] ` <20130601205854.GP21458-w6h1d7rV6+VwFLYp8hBm2A@public.gmane.org> 0 siblings, 1 reply; 4+ messages in thread From: Georg Sauthoff @ 2013-06-01 20:58 UTC (permalink / raw) To: Michael Kerrisk; +Cc: linux-man-u79uwXL29TY76Z2rM5mHXA On Sun, May 19, 2013 at 10:46:49PM +0200, Michael Kerrisk wrote: Hi, > On 05/12/13 22:22, Georg Sauthoff wrote: > > the strtok man-page does not specify what happens if `str` does not > > contain any `delim`iters. > > If you don't already know strtok one might assume then either: > > - the delimiter-less string is considered the next token and thus > > returned upon the first call to strtok > > - on the first call NULL is returned because there are not any tokens > > Related issue: is the non-empty `str`-suffix after the last `delim` > > returned as a token or not? > Thanks for the report, though I must say that it would have been a > little more helpful if you had answered the rhetorical questions you > ask... (It would have aided me as I reminded myself of the details.) oh, sorry, you are right. > > I know how strtok works - > > In which case a patch would have made the bug report even better still! Ok, next time. > > but perhaps it makes sense to explicitly > > address these issues in the Linux man page. > Anyway, I agree that the page could be clearer. I've added the following text: > A sequence of calls to strtok() that operate on the same string > maintains a pointer that determines the point from which to > start searching for the next token. The first call to strtok() > sets this pointer to point to the first byte of the string. > The start of the next token is determined by scanning forward > for the next nondelimiter byte in str. If such a byte is > found, it is taken as the start of the next token. If no such > byte is found, then there are no more tokens, and strtok() > returns NULL. (A string that is empty or that contains only > delimiter will thus cause strtok() to return NULL on the first plural, delimiters? > call.) > > The end of each token is found by scanning forward until either > the next delimiter byte is found or until the terminating null > byte ('\0') is encountered. If a delimiter byte is found, it > is overwritten with a null byte to terminate the current token, > and strtok() saves a pointer to the following byte; that > pointer will be used as the starting point when searching for > the next token. In this case, strtok() returns a pointer to > the start of the found token. > From the above description, it follows thatt a sequence of two typo: /thatt/that/ > or more contiguous delimiter bytes in the parsed string is con‐ > sidered to be a single delimiter, and that delimiter bytes at > the start or end of the string are ignored. Put another way: > the tokens returned by strtok() are always nonempty strings. > Thus, for example, given the string "aaa;;bbb,", successive > calls to strtok() that specify the delimiter string ";," would > return the strings "aaa" and "bbb", and then a NULL pointer. > Sufficient? Yes! Best regards Georg -- To unsubscribe from this list: send the line "unsubscribe linux-man" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 4+ messages in thread
[parent not found: <20130601205854.GP21458-w6h1d7rV6+VwFLYp8hBm2A@public.gmane.org>]
* Re: strtok - strings without delimiters [not found] ` <20130601205854.GP21458-w6h1d7rV6+VwFLYp8hBm2A@public.gmane.org> @ 2013-06-02 12:28 ` Michael Kerrisk 0 siblings, 0 replies; 4+ messages in thread From: Michael Kerrisk @ 2013-06-02 12:28 UTC (permalink / raw) To: Georg Sauthoff; +Cc: Michael Kerrisk, linux-man-u79uwXL29TY76Z2rM5mHXA Hi Georg, On 06/01/13 22:58, Georg Sauthoff wrote: > On Sun, May 19, 2013 at 10:46:49PM +0200, Michael Kerrisk wrote: [...] >>> but perhaps it makes sense to explicitly >>> address these issues in the Linux man page. > >> Anyway, I agree that the page could be clearer. I've added the following text: > >> A sequence of calls to strtok() that operate on the same string >> maintains a pointer that determines the point from which to >> start searching for the next token. The first call to strtok() >> sets this pointer to point to the first byte of the string. > >> The start of the next token is determined by scanning forward >> for the next nondelimiter byte in str. If such a byte is >> found, it is taken as the start of the next token. If no such >> byte is found, then there are no more tokens, and strtok() >> returns NULL. (A string that is empty or that contains only >> delimiter will thus cause strtok() to return NULL on the first > > plural, delimiters? Fixed now. >> call.) >> >> The end of each token is found by scanning forward until either >> the next delimiter byte is found or until the terminating null >> byte ('\0') is encountered. If a delimiter byte is found, it >> is overwritten with a null byte to terminate the current token, >> and strtok() saves a pointer to the following byte; that >> pointer will be used as the starting point when searching for >> the next token. In this case, strtok() returns a pointer to >> the start of the found token. > >> From the above description, it follows thatt a sequence of two > > typo: /thatt/that/ Looks like I already fixed that one. >> or more contiguous delimiter bytes in the parsed string is con‐ >> sidered to be a single delimiter, and that delimiter bytes at >> the start or end of the string are ignored. Put another way: >> the tokens returned by strtok() are always nonempty strings. >> Thus, for example, given the string "aaa;;bbb,", successive >> calls to strtok() that specify the delimiter string ";," would >> return the strings "aaa" and "bbb", and then a NULL pointer. > >> Sufficient? > > Yes! Good. thank you for checking the text. Cheers, Michael -- To unsubscribe from this list: send the line "unsubscribe linux-man" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2013-06-02 12:28 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-05-12 20:22 strtok - strings without delimiters Georg Sauthoff
[not found] ` <20130512202212.GL21458-w6h1d7rV6+VwFLYp8hBm2A@public.gmane.org>
2013-05-19 20:46 ` Michael Kerrisk
[not found] ` <51993A39.3060606-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2013-06-01 20:58 ` Georg Sauthoff
[not found] ` <20130601205854.GP21458-w6h1d7rV6+VwFLYp8hBm2A@public.gmane.org>
2013-06-02 12:28 ` Michael Kerrisk
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).