Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finally something that works. How can I install and use this? thank you #78

Open
FurkanGozukara opened this issue Oct 17, 2022 · 1 comment

Comments

@FurkanGozukara
Copy link

FurkanGozukara commented Oct 17, 2022

Hello. I want to punctate big chunk of text. E.g. like below. How can I do that? Thank you

Could you write a simple python code to punctate text below?

I can install your necessary code via a pip command if you provide me. I am using Windows 10.

The text is from my lecture video (https://www.youtube.com/watch?v=_nKwisL8dTs) which I am trying to generate subtitles. Whisper does very well but fails to punctate at some parts.

okay sorry about this confusion what I did is when I have forgotten to unpause the video is simply I have coded a test button and the test button is using our original static file cmd and gif file cmd and I also fixed something in gif file cmd which is I have removed the loses command because it was giving an error now they are working I am using a wait for exists so let me show you how it works okay okay let me start test so the first process is started it is taking some time because that image is pretty big then it is starting the other one and now they are generated okay so you see original file is 820 kilobytes and let's see how much did we gain okay so 820 minus 572 over 820 you see 30 percent gain we have in this file it is significant and it has zero difference how can I be so sure about that we can be sure about that with a comparison okay so I am going to only make a single line of single pixel of difference here on this web p file and I will save it as a test on my desktop here as a png so I will name it as test to png okay and then I will save my original file as test png on the desktop here then I will use online comparison website let me show you compare image difference okay there are several pages for that so first try with diff checker diff checker is awesome website believe me okay so when I see check the difference there is a single line of difference here on this image so how they achieve this I wonder yeah so here when I hover and when I zoom in okay like this you see there is a single line a single pixel of difference here and no other differences it is exactly same and let's compare with another website okay online diff so first image and the second image so I will make the fuzziness zero and it will show as a red color okay so on this image there is a single pixel difference here which is what I have made and there is no other red dot okay so I can copy this image to zoom in so you see there is no other red dot because they are exactly same except the single line single pixel that I have made myself so basically we gain 35 percent 30 percent size in this image and on this gift image we gained from minus to 26.9 over this 35 percent you see with on the gift image we gain 35 percent and let's test if they are working or not so this is our WebP GIF and this is our iponic GIF this is original GIF file and this is WebP file they are looking pretty much same to me we can also use some online websites online GIF to WebP there is one website which I have found working very well this one or yeah let's try this I think it was this one so let's open our debug test so here our GIF upload it then you see there is losing compression mixed compression I unmark them and convert the WebP so this website generated a little bit higher kilobyte because probably it is not using the best compression and that's it okay so we are able to properly convert GIF and static PNG and probably GPX as well we haven't tested GPX so let's also test the GPX for example yeah this wallpaper it's pretty big so it will probably take a lot of time okay let's copy and paste this okay so I will remove this probably we don't even need it right now what is the file name it is this I am not sure if it if it can produce better than GPX because GPX is already losing compression as you know okay let's try it so all processes started at the same time because we are not waiting them and they are running right now as they get completed it will close the window and why it takes so long is that we are using the best possible algorithm and let's see the output okay so yes the WebP file is bigger than the original GPX it is because GPX is already losing and when I save this GPX as a PNG let's see the size okay size of the PNG is this we can of course optimize it a little bit more with PNG out win and I am pretty sure there will be still significant difference between PNG version and WebP version this is a software that I have purchased to optimize my PNG files previously but it is not anymore necessary because now we can use WebP format which is much better format okay so this software is single threaded on a single image so it is taking some time it has so many passes okay so the optimized PNG file is 2.53 megabytes and minus 1.52 megabytes over or not this one actually since GPX files are already losing we shouldn't convert them to WebP probably we we cannot we cannot achieve same quality I wonder if there is an losing but no point of converting GPX into WebP let me check that first okay okay same quality for GPX I think we need to have some losing compression probably for GPX compression we need to use some other methodology so let's see which which options we can use okay let's see okay so there is version loses near loses int so we can use near loses for GPX I think okay so which which option should we use I'm not sure I think I will try near loses yeah let's try it with so for that I'm going to have another file it will be for GPX for GPX I'm going to remove loses and change it with near loses with zero and I think I have to remove z9 as well so yeah I have to remove z9 okay let's try this way for GPX okay and this is the file name okay let's test GPX SR or a GPX and let's comment out this is and let's make it like this yeah okay let's see what kind of results we are going to get with GPX command okay so it is done oh wow now we have a better result than original GPX so let's compare two images quality of course I am not expecting them to be same yeah I can see the difference there is already some difference but I am not sure if we have lost some quality or not yeah we have lost some quality as you can see definitely and it is not small as well okay I wonder if it is possible to compress GPX losing quality is this even possible I'm not sure compress GPX okay okay

Also I tested your demo and it did a decent output as below

Result:
Okay, sorry about this confusion, what I did is when I have forgotten to unpause. The video is simply, I have coded a test button and the test button is using our original static file, cmd and gif file cmd, and I also fixed something in gif file cmd, which is, I have removed the loses command because it was giving an error. Now they are working. I am using a wait for exists, so let me show you how it works. Okay, okay, let me start test, so the first process is started. It is taking some time because that image is pretty big. Then it is starting. The other one, and now they are generated okay, so you see original file is 820 kilobytes and let's see how much did we gain okay, so 820 minus 572. Over 820. You see 30 percent gain. We have in this file. It is significant and it has zero difference. How can I be so sure about that? We can be sure about that with a comparison. Okay, so I am going to only make a single line of single pixel of difference here on this web p file, and I will save it as a test on my desktop here as a png. So I will name it as test to png. Okay and then I will save my original file as test png on the desktop here, then I will use online comparison website. Let me show you compare image difference: okay, there are several pages for that. So first, try with diff checker diff checker is awesome website. Believe me, okay, so when I see check the difference, there is a single line of difference here on this image, so how they achieve this. I wonder yeah so here when I hover and when I zoom in okay. Like this, you see there is a single line, a single pixel of difference here and no other differences. It is exactly same and let's compare with another website: okay, online diff, so first image and the second image, so I will make the fuzziness zero and it will show as a red color, okay. So on this image there is a single pixel difference here, which is what I have made, and there is no other red dot. Okay, so I can copy this image to zoom in so you see there is no other red dot because they are exactly same except the single line, single pixel that I have made myself. So basically, we gain 35 percent 30 percent size in this image and on this gift image we gained from minus to 26.9 over this 35 percent. You see with on the gift image we gain 35 percent and let's test if they are working or not. So this is our WebP GIF and this is our iponic GIF. This is original GIF file and this is WebP file. They are looking pretty much same to me. We can also use some online websites online GIF to WebP. There is one website which I have found working very well, this one or yeah. Let'S try this. I think it was this one. So let's open our debug test, so here our GIF upload it then you see there is losing compression mixed compression. I unmark them and convert the WebP, so this website generated a little bit higher kilobyte because probably it is not using the best compression and that's it okay, so we are able to properly convert GIF and static, PNG and probably GPX as well. We haven't tested GPX, so let's also test the GPX, for example yeah this wallpaper, it's pretty big, so it will probably take a lot of time. Okay, let's copy and paste this okay, so I will remove this. Probably we don't even need it right now. What is the file name? It is this. I am not sure if it if it can produce better than GPX, because GPX is already losing compression. As you know, okay, let's try it. So all processes started at the same time, because we are not waiting them and they are running right now, as they get completed, it will close the window and why it takes so long is that we are using the best possible algorithm and, let's see the output. Okay, so yes, the WebP file is bigger than the original GPX. It is because GPX is already losing, and when I save this GPX as a PNG, let's see the size okay size of the PNG is this. We can, of course, optimize it a little bit more with PNG out win and I am pretty sure there will be still significant difference between PNG version and WebP version. This is a software that I have purchased to optimize my PNG files previously, but it is not anymore necessary because now we can use WebP format, which is much better format. Okay, so this software is single threaded on a single image, so it is taking some time it has so many passes. Okay, so the optimized PNG file is 2.53 megabytes and minus 1.52 megabytes over or not this one. Actually, since GPX files are already losing, we shouldn't convert them to WebP. Probably we we cannot. We cannot achieve same quality. I wonder if there is an losing but no point of converting GPX into WebP. Let me check that. First, okay, okay, same quality for GPX. I think we need to have some losing compression, probably for GPX compression. We need to use some other methodology, so, let's see which which options we can use. Okay, let's see. Okay, so there is version loses near loses int, so we can use near loses for GPX. I think okay, so, which which option should we use? I'M not sure I think I will try near loses yeah, let's try it with so. For that I'm going to have another file, it will be for GPX for GPX, I'm going to remove, loses and change it with near loses with zero, and I think I have to remove z9 as well, so yeah I have to remove z9. Okay. Let'S try this way for GPX okay, and this is the file name: okay, let's test GPX SR or a GPX, and let's comment out this is, and let's make it like this yeah. Okay, let's see what kind of results we are going to get with: GPX command. Okay, so it is done. Oh wow now we have a better result than original GPX. So let's compare two images quality. Of course I am not expecting them to be same yeah. I can see the difference. There is already some difference, but I am not sure if we have lost some quality or not yeah, we have lost some quality, as you can see. Definitely, and it is not small as well okay, I wonder if it is possible to compress GPX losing quality. Is this even possible, I'm not sure, compress, GPX, okay, okay,

@FurkanGozukara
Copy link
Author

A example like this i need

from fastpunct import FastPunct
# The default language is 'english'
fastpunct = FastPunct()

f = open("TextFile1.txt", "r")

text_val = f.read()

result = fastpunct.punct(text_val, correct=True)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant