2022-04-10, 05:52 PM
I'm using the DVB EPG from UK Freeview, and have noticed that while for many/most shows any series+episode numbering contained in the programme's description is identified and extracted into "proper" episode numbering - there are quite a lot of shows where this is not happening even though the relevant information *is* in the description.
Based on which are and aren't, it looks like just a case of slightly differing formats and I'm hoping that the appropriate regex (I'm assuming that's how it's done) can be updated/refined in a future version to handle these additional cases...
Example working formats: "...(S1 Ep1/8)..." & "...(S1 Ep1)..." & "...S1 Ep1"
Example formats not being recognised:
"...(S1 Ep 1)..." - e.g. used by '5USA' channel
"...(S1, ep 1)..." - e.g. used by 'Pick' & 'Sky Arts' channels
"...S1, ep 1/8..." - e.g. used by 'Forces TV' channel (at least 1 series but not sure if it's common - just found it when I searched on "S1," to find examples!)
It may just be the space before the episode number throwing it off (only factor common to them all), though the comma after the series number &/or the lowercase "ep" might also need checking! (If it isn't simply this difference in formats, I'm happy to provide fuller source data or whatever else might be needed for diagnosis.)
Thanks,
Steve
P.S. - there's also lots and lots of programmes (e.g. most BBC series) that only have episode numbers but no series info, where the description starts (e.g.) "1/8. Blah blah..."
Obviously without any series data I'm not sure it makes sense to pull out the numbering data in these cases - but I wonder if using this to create a *subtitle* of "Episode 1" (assuming no subtitle already exists) would be useful in terms of how it would be presented in the NPVR interface and/or in other front-end clients (e.g. Kodi/Plex/whatever) that people use to build libraries from the NPVR recording folder(s)? It might also help with any matching to online sources like TMDB, as this is commonly used where episodes are not individually named. Just something to perhaps consider?
Based on which are and aren't, it looks like just a case of slightly differing formats and I'm hoping that the appropriate regex (I'm assuming that's how it's done) can be updated/refined in a future version to handle these additional cases...
Example working formats: "...(S1 Ep1/8)..." & "...(S1 Ep1)..." & "...S1 Ep1"
Example formats not being recognised:
"...(S1 Ep 1)..." - e.g. used by '5USA' channel
"...(S1, ep 1)..." - e.g. used by 'Pick' & 'Sky Arts' channels
"...S1, ep 1/8..." - e.g. used by 'Forces TV' channel (at least 1 series but not sure if it's common - just found it when I searched on "S1," to find examples!)
It may just be the space before the episode number throwing it off (only factor common to them all), though the comma after the series number &/or the lowercase "ep" might also need checking! (If it isn't simply this difference in formats, I'm happy to provide fuller source data or whatever else might be needed for diagnosis.)
Thanks,
Steve
P.S. - there's also lots and lots of programmes (e.g. most BBC series) that only have episode numbers but no series info, where the description starts (e.g.) "1/8. Blah blah..."
Obviously without any series data I'm not sure it makes sense to pull out the numbering data in these cases - but I wonder if using this to create a *subtitle* of "Episode 1" (assuming no subtitle already exists) would be useful in terms of how it would be presented in the NPVR interface and/or in other front-end clients (e.g. Kodi/Plex/whatever) that people use to build libraries from the NPVR recording folder(s)? It might also help with any matching to online sources like TMDB, as this is commonly used where episodes are not individually named. Just something to perhaps consider?