Announcement

Collapse
No announcement yet.

Max subtitles duplicating

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

    Max Max subtitles duplicating

    'm not sure when this started as I haven't grabbed from Max in a few weeks but I noticed it over the weekend. No matter what I download, movie or TV series, it has the subtitles duplicated in the SRT files. It's almost like it's trying to grab a regular and the cc subtitles and putting them into 1 file. However when given the option of what to download, there is one option to select. I'm getting the same results from 6.2.1.2 and 6.2.1.0. And unfortunately due to formatting differences in the lines cannot be easily be removed as duplications. Does anyone know how far old of a version of SF may be able to download correctly? Or I'm guessing this is more of a Max issue.

    Click image for larger version

Name:	cado.jpg
Views:	235
Size:	82.4 KB
ID:	457013
    Attached Files

    #2
    I've seen that since really early versions on and off. Enough that I use SubetitleEdit to fix it. This is the command I run on each SRT in my post processing script. {0} would be the path to your subtitle file. Just use /MergeSameTexts if you only want to fix the doubles.

    SubtitleEdit.exe /Convert "{0}" SubRip /MergeSameTexts /RemoveTextForHI /FixCommonErrors /RedoCasing /overwrite

    Comment


      #3
      Additional info:

      In USA only English CC subs offered for your title..
      .. and the srt is fine (without doubled text) and so no Max or SF problem

      This happen with mpc-hc and mpc-be but never in vlc player ..
      .. so it's a player problem​

      Comment


        #4
        I tried the Merge Lines with Same texts and it did help with a lot of duplicates, however sometimes it fails because the lines will be "[constable] What are you, <br /> some kind of an author?" vs "What are you, some kind of an author?​​" so the <br /> throws it off, or it may be "-What are you, some kind of an author?​​" so the - messes it up. I also can't do a remove same time codes because the are slightly off at 8:14.911 vs 8:15.245.

        I tried the extra commands and it did clear up a lot of the issues so thanks for that! However there are still duplicates due to the change in punctuations sometimes as seen below. Any suggestion for that?

        Click image for larger version  Name:	lot.jpg Views:	0 Size:	92.1 KB ID:	457022

        Comment


          #5
          Originally posted by Germania View Post
          Additional info:

          In USA only English CC subs offered for your title..
          .. and the srt is fine (without doubled text) and so no Max or SF problem

          This happen with mpc-hc and mpc-be but never in vlc player ..
          .. so it's a player problem​
          It is not a player problem. You can see it right here in the SRT file the lines are duplicated at line 270 and line 573. And as you can see my post above even on Salem's Lot lines are duplicated with slightly different punctuations.

          Click image for larger version  Name:	cado2.jpg Views:	1 Size:	122.5 KB ID:	457028

          Click image for larger version  Name:	cado3.jpg Views:	1 Size:	122.6 KB ID:	457029

          And here we are in Plex showing the duplicated subtitles. I'm not sure why VLC doesn't display them, but they are definitely there.

          Click image for larger version

Name:	plex.jpg
Views:	216
Size:	68.9 KB
ID:	457031
          Last edited by jinxed; 01-06-2025, 05:23 PM.

          Comment


            #6
            Originally posted by jinxed View Post
            It is not a player problem.
            Yes, I was probably premature, because the duplicates are not listed
            one after the other - but much later (not sorted chronologically)

            try this - here

            Comment


              #7
              Well, that's odd. usually when I see duplicates in the srt file, they are on the two consecutive lines. not separated by a few hundred. But no, if there is any difference, like punctuation or line breaks, it won't merge it.

              Anyway, I downloaded Salem's Lot just now with the settings bellow on 6.2.1.2 and no issues with duplicate lines that I can see. Went the time stamp in your fist picture and it's fine. In the file I downloaded prior to processing has 2802 lines.

              You might try clearing your SF temp folder. Path can be found in the general settings of SF.

              Click image for larger version

Name:	image.png
Views:	167
Size:	31.6 KB
ID:	457041

              Click image for larger version

Name:	image.png
Views:	163
Size:	110.6 KB
ID:	457042​​

              Comment


                #8
                Originally posted by AGuyWithAComputer View Post
                Well, that's odd. usually when I see duplicates in the srt file, they are on the two consecutive lines. not separated by a few hundred. But no, if there is any difference, like punctuation or line breaks, it won't merge it.

                Anyway, I downloaded Salem's Lot just now with the settings bellow on 6.2.1.2 and no issues with duplicate lines that I can see. Went the time stamp in your fist picture and it's fine. In the file I downloaded prior to processing has 2802 lines.

                You might try clearing your SF temp folder. Path can be found in the general settings of SF.
                I cleared the temp folder. Closed out SF and reopened. I deleted Salem's Lot and redownloaded it. Still messed up. I even tried downloading a higher quality one, same result. As you can see the time stamps are a fraction of a second off from each other and they are formatted differently with "Uncle!" and "Uncle,"

                Click image for larger version  Name:	lot2.jpg Views:	4 Size:	122.7 KB ID:	457045 Click image for larger version  Name:	lot1.jpg Views:	5 Size:	108.8 KB ID:	457044

                And here is an even weirder one. Same spot in the file, nearly identical time code, but one has the subtitle as saying [Parkins] and the other says [constable].

                Click image for larger version  Name:	lot4.jpg Views:	0 Size:	125.3 KB ID:	457047 Click image for larger version  Name:	lot3.jpg Views:	4 Size:	114.8 KB ID:	457046

                Comment


                  #9
                  Originally posted by AGuyWithAComputer View Post
                  ...
                  Anyway, I downloaded Salem's Lot just now with the settings bellow on 6.2.1.2 and no issues with duplicate lines that I can see. Went the time stamp in your fist picture and it's fine. In the file I downloaded prior to processing has 2802 lines.
                  ​​
                  ​It happens that I also have the Salem's Lot downloaded with a date of 2024.X.8 - so with some older SF. English CC srt file also has exactly 2802 lines, 195 696 bytes. The last few lines from your screenshot are identical.

                  The example conflict in this file is on lines 240:
                  Code:
                  00:16:22,190 --> 00:16:25,026
                  We take people at their word
                  here in the Lot.​
                  and 468:
                  Code:
                  468
                  00:16:22,607 --> 00:16:24,609
                  We take people at their word
                  here in the Lot.​
                  ... among many other.

                  Comment


                    #10
                    Originally posted by jinxed View Post
                    I cleared the temp folder. Closed out SF and reopened. I deleted Salem's Lot and redownloaded it. Still messed up. I even tried downloading a higher quality one, same result. As you can see the time stamps are a fraction of a second off from each other and they are formatted differently with "Uncle!" and "Uncle,"
                    ...
                    And here is an even weirder one. Same spot in the file, nearly identical time code, but one has the subtitle as saying [Parkins] and the other says [constable].
                    ...
                    ​I can also confirm all of the above.

                    So the problem exists since at least 3 months.

                    Comment


                      #11
                      So here's something weirder still. I went back and checked Doom Patrol which was able to grab a few weeks ago and the episodes I spot checked are fine. I also checked the first two episodes of Creature Commandos I grabbed a couple weeks ago and they are fine. But I grabbed Creature Commandos again today to get the updated episodes, and they are messed up with the duplication.

                      So it seems to be a communication issue with SF and Max. Or a Max issue. Either way, it may be hit or miss, or at least inconsistent. Which means, fixing it won't be easy.

                      Comment


                        #12
                        Originally posted by jinxed View Post
                        Which means, fixing it won't be easy.
                        It's not exactly easy vs hard.

                        More like another hassle to go through while dealing with SF. There's never enough of these.

                        It is not a typical problem - so that there isn't any tool to repair such subtitles.

                        However, writing the script to detect that situation is probably trivial (the trigger being the time going 'backwards' substantially (like over a minute) between two consecutive lines), and a script to fix the issue (by deleting one of the 'variants' for each repeating 'chunk' of lines) would be relatively easy.

                        Maybe I'm missing something that would complicate the process - but hopefully I'm not.

                        Comment


                          #13
                          Originally posted by ms-dfav View Post
                          It's not exactly easy vs hard.

                          More like another hassle to go through while dealing with SF. There's never enough of these.

                          It is not a typical problem - so that there isn't any tool to repair such subtitles.

                          However, writing the script to detect that situation is probably trivial (the trigger being the time going 'backwards' substantially (like over a minute) between two consecutive lines), and a script to fix the issue (by deleting one of the 'variants' for each repeating 'chunk' of lines) would be relatively easy.

                          Maybe I'm missing something that would complicate the process - but hopefully I'm not.
                          see this thread post #2 - https://forum.dvdfab.cn/forum/stream...ithin-srt-file
                          you can use subtitle edit (free tool) to get rid of duplicate subtitles.

                          Comment


                            #14
                            Originally posted by october262 View Post

                            see this thread post #2 - https://forum.dvdfab.cn/forum/stream...ithin-srt-file
                            you can use subtitle edit (free tool) to get rid of duplicate subtitles.
                            This is not enough, as they are not simply duplicates, as stated above.

                            Comment


                              #15
                              The causal problem is that the mpd contains 2x the Label en-US CC ..
                              .. with "Salem's Lot" there is one for full Movie and one for the first additional 15 seconds (The "Skip" part)

                              Click image for larger version  Name:	max_mpd_en-us_doubled.jpg Views:	0 Size:	8.4 KB ID:	457065

                              But t0 has a duration of only 15,015 seconds (1 Segment) - auto adding t13 (1:53:47) is wrong

                              SF (and some other tools) dl segments for both (with ignoring duration for t0) and add this together.

                              The right subs (full CC - like in Max player) are here

                              Comment

                              Working...
                              X