Current Path: > > > bin >
Operation : Linux premium107.web-hosting.com 4.18.0-553.44.1.lve.el8.x86_64 #1 SMP Thu Mar 13 14:29:12 UTC 2025 x86_64 Software : Apache Server IP : 198.54.126.246 | Your IP: 216.73.216.181 Domains : 1034 Domain(s) Permission : [ 0555 ]
| Name | Type | Size | Last Modified | Actions |
|---|---|---|---|---|
| 7za | File | 46 bytes | October 04 2024 01:50:29. | |
| GET | File | 16217 bytes | April 22 2022 20:13:06. | |
| File | 418704 bytes | October 13 2019 00:19:50. | ||
| [ | File | 54864 bytes | December 18 2025 13:54:40. | |
| aclocal | File | 36478 bytes | October 14 2023 20:00:43. | |
| aclocal-1.16 | File | 36478 bytes | October 14 2023 20:00:43. | |
| addr2line | File | 34224 bytes | December 18 2025 14:21:35. | |
| animate | File | 12128 bytes | April 01 2025 13:15:58. | |
| ar | File | 63448 bytes | December 18 2025 14:21:35. | |
| arch | File | 38232 bytes | December 18 2025 13:54:40. | |
| arpaname | File | 12104 bytes | November 06 2025 07:45:52. | |
| as | File | 911264 bytes | December 18 2025 14:21:35. | |
| aspell | File | 163328 bytes | April 18 2022 15:10:53. | |
| at | File | 1276 bytes | October 10 2022 10:23:17. | |
| atq | File | 1278 bytes | October 10 2022 10:23:17. | |
| atrm | File | 1280 bytes | October 10 2022 10:23:17. | |
| autoconf | File | 14768 bytes | August 13 2024 19:30:48. | |
| autoheader | File | 8534 bytes | August 13 2024 19:30:48. | |
| autom4te | File | 32181 bytes | August 13 2024 19:30:48. | |
| automake | File | 257949 bytes | October 14 2023 20:00:43. | |
| automake-1.16 | File | 257949 bytes | October 14 2023 20:00:43. | |
| autoreconf | File | 21066 bytes | August 13 2024 19:30:48. | |
| autoscan | File | 17124 bytes | August 13 2024 19:30:48. | |
| autoupdate | File | 33872 bytes | August 13 2024 19:30:48. | |
| awk | File | 685848 bytes | April 18 2022 15:56:05. | |
| b2sum | File | 59072 bytes | December 18 2025 13:54:40. | |
| base32 | File | 42464 bytes | December 18 2025 13:54:40. | |
| base64 | File | 42488 bytes | December 18 2025 13:54:40. | |
| basename | File | 38312 bytes | December 18 2025 13:54:40. | |
| bash | File | 1154680 bytes | August 26 2025 08:48:39. | |
| bashbug-64 | File | 7348 bytes | August 26 2025 08:48:31. | |
| batch | File | 137 bytes | October 10 2022 10:23:17. | |
| bison | File | 448224 bytes | October 12 2019 12:28:39. | |
| bunzip2 | File | 37744 bytes | January 28 2025 01:38:59. | |
| bzcat | File | 37744 bytes | January 28 2025 01:38:59. | |
| bzcmp | File | 2128 bytes | January 28 2025 01:38:58. | |
| bzdiff | File | 2128 bytes | January 28 2025 01:38:58. | |
| bzgrep | File | 1677 bytes | January 28 2025 01:38:58. | |
| bzip2 | File | 37744 bytes | January 28 2025 01:38:59. | |
| bzip2recover | File | 16832 bytes | January 28 2025 01:38:59. | |
| bzless | File | 1259 bytes | January 28 2025 01:38:58. | |
| bzmore | File | 1259 bytes | January 28 2025 01:38:58. | |
| c++ | File | 1266616 bytes | August 26 2025 09:47:24. | |
| c++filt | File | 29584 bytes | December 18 2025 14:21:35. | |
| c89 | File | 224 bytes | August 26 2025 09:45:19. | |
| c99 | File | 215 bytes | August 26 2025 09:45:19. | |
| cagefs_enter.proxied | File | 1296 bytes | January 20 2026 13:58:37. | |
| cal | File | 67560 bytes | February 04 2026 20:18:19. | |
| captoinfo | File | 87360 bytes | October 14 2023 18:54:49. | |
| cat | File | 38360 bytes | December 18 2025 13:54:40. | |
| catchsegv | File | 3283 bytes | December 08 2025 09:36:48. | |
| cc | File | 1266576 bytes | August 26 2025 09:47:26. | |
| chgrp | File | 67864 bytes | December 18 2025 13:54:40. | |
| chmod | File | 63688 bytes | December 18 2025 13:54:40. | |
| chown | File | 71976 bytes | December 18 2025 13:54:40. | |
| chrt | File | 38080 bytes | February 04 2026 20:18:19. | |
| cksum | File | 38288 bytes | December 18 2025 13:54:40. | |
| cldetect | File | 10593 bytes | December 24 2025 15:45:26. | |
| clear | File | 12840 bytes | October 14 2023 18:54:49. | |
| clusterdb | File | 71920 bytes | February 27 2024 08:25:00. | |
| cmp | File | 106248 bytes | May 04 2020 15:15:14. | |
| col | File | 29704 bytes | February 04 2026 20:18:19. | |
| colcrt | File | 16872 bytes | February 04 2026 20:18:19. | |
| colrm | File | 25480 bytes | February 04 2026 20:18:19. | |
| column | File | 50656 bytes | February 04 2026 20:18:19. | |
| comm | File | 42560 bytes | December 18 2025 13:54:40. | |
| compare | File | 12136 bytes | April 01 2025 13:15:58. | |
| composite | File | 12120 bytes | April 01 2025 13:15:58. | |
| conjure | File | 12120 bytes | April 01 2025 13:15:58. | |
| convert | File | 12120 bytes | April 01 2025 13:15:58. | |
| cp | File | 151568 bytes | December 18 2025 13:54:40. | |
| cpan | File | 8370 bytes | June 03 2025 14:32:21. | |
| cpp | File | 1266552 bytes | August 26 2025 09:47:25. | |
| createdb | File | 71904 bytes | February 27 2024 08:25:00. | |
| createuser | File | 76416 bytes | February 27 2024 08:25:00. | |
| crontab | File | 1524 bytes | April 06 2024 11:40:05. | |
| crontab.cagefs | File | 55456 bytes | January 20 2026 14:24:20. | |
| csplit | File | 54968 bytes | December 18 2025 13:54:40. | |
| curl | File | 235600 bytes | December 18 2025 14:22:12. | |
| cut | File | 50704 bytes | December 18 2025 13:54:40. | |
| date | File | 108496 bytes | December 18 2025 13:54:40. | |
| dd | File | 79840 bytes | December 18 2025 13:54:40. | |
| delv | File | 43480 bytes | November 06 2025 07:45:52. | |
| df | File | 93272 bytes | December 18 2025 13:54:40. | |
| diff | File | 274440 bytes | May 04 2020 15:15:14. | |
| diff3 | File | 131688 bytes | May 04 2020 15:15:14. | |
| dig | File | 166072 bytes | November 06 2025 07:45:52. | |
| dir | File | 143256 bytes | December 18 2025 13:54:40. | |
| dircolors | File | 50744 bytes | December 18 2025 13:54:40. | |
| dirname | File | 34160 bytes | December 18 2025 13:54:40. | |
| display | File | 12128 bytes | April 01 2025 13:15:58. | |
| dnstap-read | File | 20920 bytes | November 06 2025 07:45:52. | |
| dropdb | File | 67608 bytes | February 27 2024 08:25:00. | |
| dropuser | File | 67576 bytes | February 27 2024 08:25:00. | |
| du | File | 109568 bytes | December 18 2025 13:54:40. | |
| easy_install-3 | File | bytes | January 01 1970 00:00:00. | |
| echo | File | 38248 bytes | December 18 2025 13:54:40. | |
| egrep | File | 28 bytes | October 11 2019 15:15:56. | |
| enc2xs | File | 41958 bytes | October 13 2019 08:46:10. | |
| enchant | File | 21584 bytes | October 23 2019 20:03:55. | |
| enchant-lsmod | File | 13408 bytes | October 23 2019 20:03:55. | |
| env | File | 42344 bytes | December 18 2025 13:54:40. | |
| eps2eps | File | 639 bytes | June 03 2025 02:48:11. | |
| eqn | File | 237728 bytes | October 13 2019 14:29:41. | |
| ex | File | 1180352 bytes | October 12 2025 00:23:15. | |
| expand | File | 42592 bytes | December 18 2025 13:54:40. | |
| expr | File | 50760 bytes | December 18 2025 13:54:40. | |
| factor | File | 88032 bytes | December 18 2025 13:54:40. | |
| false | File | 34120 bytes | December 18 2025 13:54:40. | |
| fc-cache | File | 132 bytes | August 16 2021 11:04:22. | |
| fc-cache-64 | File | 20840 bytes | October 09 2021 09:02:09. | |
| fc-cat | File | 16744 bytes | October 09 2021 09:02:09. | |
| fc-conflist | File | 12544 bytes | October 09 2021 09:02:09. | |
| fc-list | File | 12544 bytes | October 09 2021 09:02:09. | |
| fc-match | File | 16648 bytes | October 09 2021 09:02:09. | |
| fc-pattern | File | 12552 bytes | October 09 2021 09:02:09. | |
| fc-query | File | 12536 bytes | October 09 2021 09:02:09. | |
| fc-scan | File | 12552 bytes | October 09 2021 09:02:09. | |
| fc-validate | File | 16648 bytes | October 09 2021 09:02:09. | |
| fgrep | File | 28 bytes | October 11 2019 15:15:56. | |
| file | File | 25280 bytes | October 07 2025 06:30:49. | |
| find | File | 228632 bytes | December 18 2025 14:57:01. | |
| flex | File | 438728 bytes | October 12 2019 12:33:17. | |
| flex++ | File | 438728 bytes | October 12 2019 12:33:17. | |
| flock | File | 33992 bytes | February 04 2026 20:18:19. | |
| fmt | File | 46584 bytes | December 18 2025 13:54:40. | |
| fold | File | 42408 bytes | December 18 2025 13:54:40. | |
| free | File | 21288 bytes | October 14 2023 20:31:02. | |
| freetype-config | File | 4416 bytes | March 31 2025 19:37:30. | |
| funzip | File | 37504 bytes | June 03 2025 02:10:12. | |
| g++ | File | 1266616 bytes | August 26 2025 09:47:24. | |
| gawk | File | 685848 bytes | April 18 2022 15:56:05. | |
| gcc | File | 1266576 bytes | August 26 2025 09:47:26. | |
| gcc-ar | File | 37536 bytes | August 26 2025 09:47:26. | |
| gcc-nm | File | 37536 bytes | August 26 2025 09:47:26. | |
| gcc-ranlib | File | 37536 bytes | August 26 2025 09:47:26. | |
| gcov | File | 1377408 bytes | August 26 2025 09:47:26. | |
| gcov-dump | File | 584664 bytes | August 26 2025 09:47:26. | |
| gcov-tool | File | 622360 bytes | August 26 2025 09:47:26. | |
| gem | File | 542 bytes | May 14 2025 12:40:38. | |
| gencat | File | 25432 bytes | December 08 2025 09:43:19. | |
| geoiplookup | File | 22416 bytes | November 28 2019 22:06:01. | |
| geoiplookup6 | File | 22168 bytes | November 28 2019 22:06:01. | |
| geqn | File | 237728 bytes | October 13 2019 14:29:41. | |
| getconf | File | 33240 bytes | December 08 2025 09:43:19. | |
| getent | File | 33920 bytes | December 08 2025 09:43:19. | |
| getopt | File | 21024 bytes | February 04 2026 20:18:19. | |
| ghostscript | File | 12648 bytes | June 03 2025 02:48:22. | |
| git | File | 3845928 bytes | July 23 2025 06:59:57. | |
| git-receive-pack | File | 3845928 bytes | July 23 2025 06:59:57. | |
| git-shell | File | 2236688 bytes | July 23 2025 06:59:57. | |
| git-upload-archive | File | 3845928 bytes | July 23 2025 06:59:57. | |
| git-upload-pack | File | 3845928 bytes | July 23 2025 06:59:57. | |
| gm | File | 8008 bytes | March 28 2022 15:50:37. | |
| gmake | File | 240968 bytes | April 18 2022 16:38:34. | |
| gneqn | File | 908 bytes | October 13 2019 14:29:37. | |
| gnroff | File | 3312 bytes | October 13 2019 14:29:39. | |
| gpg | File | 1090344 bytes | January 16 2026 01:48:25. | |
| gpg-agent | File | 429360 bytes | January 16 2026 01:48:25. | |
| gpg-error | File | 34976 bytes | October 12 2019 12:20:46. | |
| gpg-zip | File | 3525 bytes | January 16 2026 01:48:21. | |
| gpgsplit | File | 89112 bytes | January 16 2026 01:48:25. | |
| gpgv | File | 462344 bytes | January 16 2026 01:48:25. | |
| gpic | File | 300896 bytes | October 13 2019 14:29:41. | |
| gprof | File | 105832 bytes | December 18 2025 14:21:35. | |
| grep | File | 198280 bytes | October 11 2019 15:15:57. | |
| groff | File | 127920 bytes | October 13 2019 14:29:41. | |
| grops | File | 195728 bytes | October 13 2019 14:29:41. | |
| grotty | File | 145304 bytes | October 13 2019 14:29:41. | |
| groups | File | 38288 bytes | December 18 2025 13:54:40. | |
| gs | File | 12648 bytes | June 03 2025 02:48:22. | |
| gsnd | File | 277 bytes | June 03 2025 02:48:11. | |
| gsoelim | File | 43576 bytes | October 13 2019 14:29:41. | |
| gtar | File | 459768 bytes | August 26 2025 08:57:37. | |
| gtbl | File | 158320 bytes | October 13 2019 14:29:41. | |
| gtroff | File | 824344 bytes | October 13 2019 14:29:41. | |
| gunzip | File | 2345 bytes | April 27 2022 05:49:28. | |
| gzexe | File | 6375 bytes | April 27 2022 05:49:28. | |
| gzip | File | 96944 bytes | April 27 2022 05:49:29. | |
| h2ph | File | 29382 bytes | July 28 2025 08:08:32. | |
| h2xs | File | 60866 bytes | July 28 2025 08:07:09. | |
| head | File | 46592 bytes | December 18 2025 13:54:40. | |
| hexdump | File | 58888 bytes | February 04 2026 20:18:19. | |
| host | File | 145704 bytes | November 06 2025 07:45:52. | |
| hostid | File | 34136 bytes | December 18 2025 13:54:40. | |
| hostname | File | 21664 bytes | October 11 2019 13:06:51. | |
| hunspell | File | 148168 bytes | October 13 2019 08:33:15. | |
| iconv | File | 62904 bytes | December 08 2025 09:43:19. | |
| id | File | 46528 bytes | December 18 2025 13:54:40. | |
| identify | File | 12128 bytes | April 01 2025 13:15:58. | |
| idn | File | 40352 bytes | October 13 2019 16:55:35. | |
| ifnames | File | 4128 bytes | August 13 2024 19:30:48. | |
| import | File | 12120 bytes | April 01 2025 13:15:58. | |
| infocmp | File | 62512 bytes | October 14 2023 18:54:49. | |
| infotocap | File | 87360 bytes | October 14 2023 18:54:49. | |
| install | File | 159912 bytes | December 18 2025 13:54:40. | |
| instmodsh | File | 4194 bytes | October 13 2019 08:55:21. | |
| ionice | File | 29680 bytes | February 04 2026 20:18:19. | |
| ipcrm | File | 29688 bytes | February 04 2026 20:18:19. | |
| ipcs | File | 54680 bytes | February 04 2026 20:18:19. | |
| isosize | File | 25480 bytes | February 04 2026 20:18:19. | |
| ispell | File | 988 bytes | April 18 2022 15:10:51. | |
| join | File | 54984 bytes | December 18 2025 13:54:40. | |
| kill | File | 38176 bytes | February 04 2026 20:18:19. | |
| ld | File | 1789080 bytes | December 18 2025 14:21:35. | |
| ld.bfd | File | 1789080 bytes | December 18 2025 14:21:35. | |
| ldd | File | 5441 bytes | December 08 2025 09:36:51. | |
| less | File | 177928 bytes | July 02 2024 20:10:37. | |
| lessecho | File | 12696 bytes | July 02 2024 20:10:37. | |
| lesskey | File | 22520 bytes | July 02 2024 20:10:37. | |
| lesspipe.sh | File | 3143 bytes | July 02 2024 19:57:10. | |
| lex | File | 438728 bytes | October 12 2019 12:33:17. | |
| libnetcfg | File | 15775 bytes | July 28 2025 08:08:32. | |
| libtool | File | 367724 bytes | October 11 2019 14:55:28. | |
| libtoolize | File | 129197 bytes | October 11 2019 14:55:28. | |
| link | File | 34136 bytes | December 18 2025 13:54:40. | |
| ln | File | 72192 bytes | December 18 2025 13:54:40. | |
| locale | File | 57800 bytes | December 08 2025 09:43:19. | |
| localedef | File | 314848 bytes | December 08 2025 09:43:19. | |
| logger | File | 51184 bytes | February 04 2026 20:18:19. | |
| login | File | 41944 bytes | February 04 2026 20:18:19. | |
| logname | File | 34144 bytes | December 18 2025 13:54:40. | |
| look | File | 16856 bytes | February 04 2026 20:18:19. | |
| ls | File | 143248 bytes | December 18 2025 13:54:40. | |
| lynx | File | 1928720 bytes | April 18 2022 21:01:05. | |
| m4 | File | 190016 bytes | October 11 2019 14:41:40. | |
| File | 418704 bytes | October 13 2019 00:19:50. | ||
| mailx | File | 418704 bytes | October 13 2019 00:19:50. | |
| make | File | 240968 bytes | April 18 2022 16:38:34. | |
| make-dummy-cert | File | 610 bytes | January 08 2026 13:57:14. | |
| mariadb | File | 5605864 bytes | November 28 2025 17:50:10. | |
| mariadb-access | File | 112112 bytes | November 28 2025 17:50:11. | |
| mariadb-admin | File | 5126776 bytes | November 28 2025 17:50:10. | |
| mariadb-binlog | File | 5411576 bytes | November 28 2025 17:50:10. | |
| mariadb-check | File | 5123960 bytes | November 28 2025 17:50:10. | |
| mariadb-conv | File | 4815712 bytes | November 28 2025 17:50:10. | |
| mariadb-convert-table-format | File | 4386 bytes | November 28 2025 17:50:11. | |
| mariadb-dump | File | 5229744 bytes | November 28 2025 17:50:10. | |
| mariadb-dumpslow | File | 8382 bytes | November 28 2025 17:50:11. | |
| mariadb-embedded | File | 25792464 bytes | November 28 2025 17:50:11. | |
| mariadb-find-rows | File | 3433 bytes | November 28 2025 17:50:11. | |
| mariadb-hotcopy | File | 35497 bytes | November 28 2025 17:50:11. | |
| mariadb-import | File | 5114744 bytes | November 28 2025 17:50:10. | |
| mariadb-plugin | File | 4794424 bytes | November 28 2025 17:50:10. | |
| mariadb-secure-installation | File | 13993 bytes | November 28 2025 17:50:11. | |
| mariadb-setpermission | File | 18128 bytes | November 28 2025 17:50:11. | |
| mariadb-show | File | 5113136 bytes | November 28 2025 17:50:10. | |
| mariadb-slap | File | 5132600 bytes | November 28 2025 17:50:10. | |
| mariadb-tzinfo-to-sql | File | 4793896 bytes | November 28 2025 17:50:11. | |
| mariadb-waitpid | File | 4781272 bytes | November 28 2025 17:50:11. | |
| mc | File | 1360320 bytes | October 18 2019 20:38:40. | |
| mcdiff | File | 1360320 bytes | October 18 2019 20:38:40. | |
| mcedit | File | 1360320 bytes | October 18 2019 20:38:40. | |
| mcookie | File | 34064 bytes | February 04 2026 20:18:19. | |
| mcview | File | 1360320 bytes | October 18 2019 20:38:40. | |
| md5sum | File | 46632 bytes | December 18 2025 13:54:40. | |
| mesg | File | 16752 bytes | February 04 2026 20:18:19. | |
| mkdir | File | 84680 bytes | December 18 2025 13:54:40. | |
| mkfifo | File | 68056 bytes | December 18 2025 13:54:40. | |
| mktemp | File | 46760 bytes | December 18 2025 13:54:40. | |
| mogrify | File | 12120 bytes | April 01 2025 13:15:58. | |
| montage | File | 12120 bytes | April 01 2025 13:15:58. | |
| more | File | 46016 bytes | February 04 2026 20:18:19. | |
| msql2mysql | File | 1446 bytes | November 28 2025 17:50:11. | |
| mv | File | 147408 bytes | December 18 2025 13:54:40. | |
| my_print_defaults | File | 4781680 bytes | November 28 2025 17:50:10. | |
| mysql | File | 5605864 bytes | November 28 2025 17:50:10. | |
| mysql_config | File | 4574 bytes | November 28 2025 17:50:41. | |
| mysql_find_rows | File | 3433 bytes | November 28 2025 17:50:11. | |
| mysql_waitpid | File | 4781272 bytes | November 28 2025 17:50:11. | |
| mysqlaccess | File | 112112 bytes | November 28 2025 17:50:11. | |
| mysqladmin | File | 5126776 bytes | November 28 2025 17:50:10. | |
| mysqlbinlog | File | 5411576 bytes | November 28 2025 17:50:10. | |
| mysqlcheck | File | 5123960 bytes | November 28 2025 17:50:10. | |
| mysqldump | File | 5229744 bytes | November 28 2025 17:50:10. | |
| mysqlimport | File | 5114744 bytes | November 28 2025 17:50:10. | |
| mysqlshow | File | 5113136 bytes | November 28 2025 17:50:10. | |
| mytop | File | 73757 bytes | November 28 2025 17:50:11. | |
| namei | File | 33896 bytes | February 04 2026 20:18:19. | |
| nano | File | 253888 bytes | September 24 2024 02:16:19. | |
| neqn | File | 908 bytes | October 13 2019 14:29:37. | |
| nice | File | 38224 bytes | December 18 2025 13:54:40. | |
| nl | File | 46648 bytes | December 18 2025 13:54:40. | |
| nm | File | 51584 bytes | December 18 2025 14:21:35. | |
| nohup | File | 38312 bytes | December 18 2025 13:54:40. | |
| nproc | File | 38304 bytes | December 18 2025 13:54:40. | |
| nroff | File | 3312 bytes | October 13 2019 14:29:39. | |
| nslookup | File | 149768 bytes | November 06 2025 07:45:52. | |
| nsupdate | File | 74808 bytes | November 06 2025 07:45:52. | |
| numfmt | File | 67216 bytes | December 18 2025 13:54:40. | |
| objcopy | File | 245832 bytes | December 18 2025 14:21:35. | |
| objdump | File | 429832 bytes | December 18 2025 14:21:35. | |
| od | File | 75576 bytes | December 18 2025 13:54:40. | |
| openssl | File | 763856 bytes | January 08 2026 13:57:28. | |
| pango-list | File | 12160 bytes | October 08 2021 15:22:06. | |
| pango-view | File | 58816 bytes | October 08 2021 15:22:06. | |
| passwd | File | 1284 bytes | April 18 2022 22:59:33. | |
| paste | File | 38280 bytes | December 18 2025 13:54:40. | |
| patch | File | 211416 bytes | June 01 2020 15:14:25. | |
| pathchk | File | 38224 bytes | December 18 2025 13:54:40. | |
| pdf2dsc | File | 698 bytes | June 03 2025 02:48:11. | |
| pdf2ps | File | 909 bytes | June 03 2025 02:48:11. | |
| perl | File | 12728 bytes | July 28 2025 08:07:20. | |
| perl5.26.3 | File | 12728 bytes | July 28 2025 08:07:20. | |
| perlbug | File | 45458 bytes | July 28 2025 08:08:32. | |
| perldoc | File | 118 bytes | October 13 2019 11:53:57. | |
| perlivp | File | 10813 bytes | July 28 2025 08:07:09. | |
| perlml | File | 7024 bytes | August 10 2022 20:54:28. | |
| perlthanks | File | 45458 bytes | July 28 2025 08:08:32. | |
| pg_dump | File | 409016 bytes | February 27 2024 08:25:00. | |
| pg_dumpall | File | 109680 bytes | February 27 2024 08:25:00. | |
| pg_restore | File | 177504 bytes | February 27 2024 08:25:00. | |
| pgrep | File | 29536 bytes | October 14 2023 20:31:02. | |
| php | File | 937 bytes | April 03 2025 11:08:35. | |
| pic | File | 300896 bytes | October 13 2019 14:29:41. | |
| piconv | File | 8271 bytes | October 13 2019 08:46:10. | |
| pinentry | File | 2404 bytes | August 12 2018 17:18:10. | |
| pinentry-curses | File | 79760 bytes | November 24 2019 17:39:34. | |
| ping | File | 67712 bytes | October 14 2023 17:19:01. | |
| pinky | File | 42456 bytes | December 18 2025 13:54:40. | |
| pip-3 | File | bytes | January 01 1970 00:00:00. | |
| pip3 | File | bytes | January 01 1970 00:00:00. | |
| pkg-config | File | 41000 bytes | October 13 2019 06:57:31. | |
| pkgconf | File | 41000 bytes | October 13 2019 06:57:31. | |
| pkill | File | 29536 bytes | October 14 2023 20:31:02. | |
| pl2pm | File | 4533 bytes | July 28 2025 08:08:16. | |
| pmap | File | 33568 bytes | October 14 2023 20:31:02. | |
| pod2html | File | 4134 bytes | July 28 2025 08:08:32. | |
| pod2man | File | 15034 bytes | October 13 2019 14:12:32. | |
| pod2text | File | 10803 bytes | October 13 2019 14:12:32. | |
| pod2usage | File | 3948 bytes | October 13 2019 11:57:18. | |
| podchecker | File | 3658 bytes | October 13 2019 11:46:35. | |
| podselect | File | 2527 bytes | October 13 2019 11:52:55. | |
| post-grohtml | File | 244456 bytes | October 13 2019 14:29:41. | |
| pr | File | 84120 bytes | December 18 2025 13:54:40. | |
| pre-grohtml | File | 133688 bytes | October 13 2019 14:29:41. | |
| precat | File | 5656 bytes | April 18 2022 15:10:50. | |
| preunzip | File | 5656 bytes | April 18 2022 15:10:50. | |
| prezip | File | 5656 bytes | April 18 2022 15:10:50. | |
| prezip-bin | File | 12264 bytes | April 18 2022 15:10:53. | |
| printenv | File | 34120 bytes | December 18 2025 13:54:40. | |
| printf | File | 54848 bytes | December 18 2025 13:54:40. | |
| prove | File | 13562 bytes | October 13 2019 12:42:34. | |
| ps | File | 137984 bytes | October 14 2023 20:31:02. | |
| ps2ascii | File | 631 bytes | June 03 2025 02:48:11. | |
| ps2epsi | File | 2752 bytes | June 03 2025 02:48:11. | |
| ps2pdf | File | 272 bytes | June 03 2025 02:48:11. | |
| ps2pdf12 | File | 215 bytes | June 03 2025 02:48:11. | |
| ps2pdf13 | File | 215 bytes | June 03 2025 02:48:11. | |
| ps2pdf14 | File | 215 bytes | June 03 2025 02:48:11. | |
| ps2pdfwr | File | 1097 bytes | June 03 2025 02:48:11. | |
| ps2ps | File | 647 bytes | June 03 2025 02:48:11. | |
| ps2ps2 | File | 669 bytes | June 03 2025 02:48:11. | |
| psql | File | 659792 bytes | February 27 2024 08:25:00. | |
| ptx | File | 79864 bytes | December 18 2025 13:54:40. | |
| pwd | File | 38320 bytes | December 18 2025 13:54:40. | |
| pwdx | File | 12984 bytes | October 14 2023 20:31:02. | |
| pydoc-3 | File | bytes | January 01 1970 00:00:00. | |
| pydoc3 | File | bytes | January 01 1970 00:00:00. | |
| python2 | File | 8032 bytes | April 10 2024 04:58:50. | |
| python2.7 | File | 8032 bytes | April 10 2024 04:58:50. | |
| python3 | File | 11872 bytes | February 05 2026 20:05:34. | |
| python3.6 | File | 11872 bytes | February 05 2026 20:05:34. | |
| python3.6m | File | 11872 bytes | February 05 2026 20:05:34. | |
| pyvenv-3 | File | bytes | January 01 1970 00:00:00. | |
| ranlib | File | 63456 bytes | December 18 2025 14:21:35. | |
| readelf | File | 639528 bytes | December 18 2025 14:21:35. | |
| readlink | File | 46984 bytes | December 18 2025 13:54:40. | |
| realpath | File | 51144 bytes | December 18 2025 13:54:40. | |
| recode | File | 48160 bytes | October 18 2019 15:18:02. | |
| reindexdb | File | 72008 bytes | February 27 2024 08:25:00. | |
| rename | File | 16896 bytes | February 04 2026 20:18:19. | |
| renew-dummy-cert | File | 725 bytes | January 08 2026 13:57:14. | |
| renice | File | 16856 bytes | February 04 2026 20:18:19. | |
| replace | File | 4763664 bytes | November 28 2025 17:50:11. | |
| reset | File | 25352 bytes | October 14 2023 18:54:49. | |
| rev | File | 12760 bytes | February 04 2026 20:18:19. | |
| rm | File | 72064 bytes | December 18 2025 13:54:40. | |
| rmdir | File | 46552 bytes | December 18 2025 13:54:40. | |
| rnano | File | 253888 bytes | September 24 2024 02:16:19. | |
| rsync | File | 522392 bytes | June 03 2025 02:02:49. | |
| ruby | File | 12128 bytes | May 14 2025 12:43:45. | |
| run-with-aspell | File | 85 bytes | April 18 2022 15:10:50. | |
| rvi | File | 1180352 bytes | October 12 2025 00:23:15. | |
| rview | File | 1180352 bytes | October 12 2025 00:23:15. | |
| rvim | File | 3067744 bytes | October 12 2025 00:23:15. | |
| scalar | File | 2290912 bytes | July 23 2025 06:59:57. | |
| scl | File | 37752 bytes | April 01 2023 16:10:31. | |
| scl_enabled | File | 258 bytes | August 25 2017 08:23:02. | |
| scl_source | File | 1863 bytes | April 01 2023 16:10:31. | |
| scp | File | 105304 bytes | December 18 2025 02:27:48. | |
| screen | File | 494040 bytes | May 05 2021 10:04:06. | |
| script | File | 37680 bytes | February 04 2026 20:18:19. | |
| sdiff | File | 107856 bytes | May 04 2020 15:15:14. | |
| sed | File | 118248 bytes | April 18 2022 21:41:04. | |
| selectorctl | File | 7812 bytes | January 23 2026 16:39:07. | |
| seq | File | 54728 bytes | December 18 2025 13:54:40. | |
| setsid | File | 16768 bytes | February 04 2026 20:18:19. | |
| setterm | File | 46208 bytes | February 04 2026 20:18:19. | |
| sftp | File | 163576 bytes | December 18 2025 02:27:48. | |
| sh | File | 1154680 bytes | August 26 2025 08:48:39. | |
| sha1sum | File | 46640 bytes | December 18 2025 13:54:40. | |
| sha224sum | File | 46672 bytes | December 18 2025 13:54:40. | |
| sha256sum | File | 46680 bytes | December 18 2025 13:54:40. | |
| sha384sum | File | 46688 bytes | December 18 2025 13:54:40. | |
| sha512sum | File | 46680 bytes | December 18 2025 13:54:40. | |
| shred | File | 63336 bytes | December 18 2025 13:54:40. | |
| shuf | File | 59488 bytes | December 18 2025 13:54:40. | |
| size | File | 34048 bytes | December 18 2025 14:21:35. | |
| skill | File | 29488 bytes | October 14 2023 20:31:02. | |
| slabtop | File | 21344 bytes | October 14 2023 20:31:02. | |
| sleep | File | 38288 bytes | December 18 2025 13:54:40. | |
| snice | File | 29488 bytes | October 14 2023 20:31:02. | |
| soelim | File | 43576 bytes | October 13 2019 14:29:41. | |
| sort | File | 126432 bytes | December 18 2025 13:54:40. | |
| spell | File | 122 bytes | April 18 2022 15:10:51. | |
| splain | File | 19150 bytes | July 28 2025 08:08:32. | |
| split | File | 59440 bytes | December 18 2025 13:54:40. | |
| sprof | File | 29360 bytes | December 08 2025 09:43:19. | |
| sqlite3 | File | 1346512 bytes | July 29 2025 01:27:56. | |
| ssh | File | 775648 bytes | December 18 2025 02:27:48. | |
| ssh-add | File | 354416 bytes | December 18 2025 02:27:48. | |
| ssh-agent | File | 333400 bytes | December 18 2025 02:27:48. | |
| ssh-copy-id | File | 10694 bytes | December 18 2025 02:27:46. | |
| ssh-keygen | File | 437424 bytes | December 18 2025 02:27:48. | |
| ssh-keyscan | File | 438800 bytes | December 18 2025 02:27:48. | |
| stat | File | 88232 bytes | December 18 2025 13:54:40. | |
| stdbuf | File | 50688 bytes | December 18 2025 13:54:40. | |
| strace | File | 2029832 bytes | January 28 2025 01:32:17. | |
| stream | File | 12112 bytes | April 01 2025 13:15:58. | |
| strings | File | 38328 bytes | December 18 2025 14:21:35. | |
| strip | File | 245856 bytes | December 18 2025 14:21:35. | |
| stty | File | 79472 bytes | December 18 2025 13:54:40. | |
| sum | File | 46624 bytes | December 18 2025 13:54:40. | |
| sync | File | 38256 bytes | December 18 2025 13:54:40. | |
| tabs | File | 16952 bytes | October 14 2023 18:54:49. | |
| tac | File | 42488 bytes | December 18 2025 13:54:40. | |
| tail | File | 75912 bytes | December 18 2025 13:54:40. | |
| tar | File | 459768 bytes | August 26 2025 08:57:37. | |
| taskset | File | 38152 bytes | February 04 2026 20:18:19. | |
| tbl | File | 158320 bytes | October 13 2019 14:29:41. | |
| tclsh | File | 9256 bytes | October 12 2019 00:25:32. | |
| tclsh8.6 | File | 9256 bytes | October 12 2019 00:25:32. | |
| tee | File | 42472 bytes | December 18 2025 13:54:40. | |
| test | File | 54848 bytes | December 18 2025 13:54:40. | |
| tic | File | 87360 bytes | October 14 2023 18:54:49. | |
| timeout | File | 42864 bytes | December 18 2025 13:54:40. | |
| tload | File | 17160 bytes | October 14 2023 20:31:02. | |
| tmpwatch | File | 36320 bytes | October 12 2019 11:32:29. | |
| toe | File | 16848 bytes | October 14 2023 18:54:49. | |
| top | File | 124616 bytes | October 14 2023 20:31:02. | |
| touch | File | 96192 bytes | December 18 2025 13:54:40. | |
| tput | File | 25392 bytes | October 14 2023 18:54:49. | |
| tr | File | 50816 bytes | December 18 2025 13:54:40. | |
| traceroute | File | 72672 bytes | March 11 2025 07:45:53. | |
| troff | File | 824344 bytes | October 13 2019 14:29:41. | |
| true | File | 34128 bytes | December 18 2025 13:54:40. | |
| truncate | File | 42352 bytes | December 18 2025 13:54:40. | |
| tset | File | 25352 bytes | October 14 2023 18:54:49. | |
| tsort | File | 42488 bytes | December 18 2025 13:54:40. | |
| tty | File | 34112 bytes | December 18 2025 13:54:40. | |
| tzselect | File | 15370 bytes | December 08 2025 09:36:35. | |
| uapi | File | 1280 bytes | January 10 2026 21:23:10. | |
| ul | File | 21080 bytes | February 04 2026 20:18:19. | |
| uname | File | 38224 bytes | December 18 2025 13:54:40. | |
| unexpand | File | 46704 bytes | December 18 2025 13:54:40. | |
| uniq | File | 50832 bytes | December 18 2025 13:54:40. | |
| unlink | File | 34144 bytes | December 18 2025 13:54:40. | |
| unversioned-python | File | bytes | January 01 1970 00:00:00. | |
| unzip | File | 206728 bytes | June 03 2025 02:10:12. | |
| unzipsfx | File | 103912 bytes | June 03 2025 02:10:12. | |
| uptime | File | 12888 bytes | October 14 2023 20:31:02. | |
| users | File | 38288 bytes | December 18 2025 13:54:40. | |
| utmpdump | File | 29352 bytes | February 04 2026 20:18:19. | |
| vacuumdb | File | 80344 bytes | February 27 2024 08:25:00. | |
| vdir | File | 143256 bytes | December 18 2025 13:54:40. | |
| vi | File | 1180352 bytes | October 12 2025 00:23:15. | |
| view | File | 1180352 bytes | October 12 2025 00:23:15. | |
| vim | File | 3067744 bytes | October 12 2025 00:23:15. | |
| vimdiff | File | 3067744 bytes | October 12 2025 00:23:15. | |
| vimtutor | File | 2121 bytes | October 12 2025 00:23:12. | |
| vmstat | File | 37672 bytes | October 14 2023 20:31:02. | |
| watch | File | 29888 bytes | October 14 2023 20:31:02. | |
| wc | File | 50832 bytes | December 18 2025 13:54:40. | |
| wget | File | 533928 bytes | August 13 2024 22:22:56. | |
| whereis | File | 29976 bytes | February 04 2026 20:18:19. | |
| which | File | 30088 bytes | August 26 2025 08:54:45. | |
| who | File | 54880 bytes | December 18 2025 13:54:40. | |
| whoami | File | 34136 bytes | December 18 2025 13:54:40. | |
| word-list-compress | File | 12280 bytes | April 18 2022 15:10:53. | |
| x86_64-redhat-linux-c++ | File | 1266616 bytes | August 26 2025 09:47:24. | |
| x86_64-redhat-linux-g++ | File | 1266616 bytes | August 26 2025 09:47:24. | |
| x86_64-redhat-linux-gcc | File | 1266576 bytes | August 26 2025 09:47:26. | |
| x86_64-redhat-linux-gcc-8 | File | 1266576 bytes | August 26 2025 09:47:26. | |
| xargs | File | 75888 bytes | December 18 2025 14:57:01. | |
| xmlcatalog | File | 20864 bytes | August 06 2025 13:59:35. | |
| xmllint | File | 75128 bytes | August 06 2025 13:59:35. | |
| xmlwf | File | 37848 bytes | November 20 2025 07:31:32. | |
| xsltproc | File | 29152 bytes | August 26 2025 08:55:17. | |
| xsubpp | File | 5080 bytes | October 13 2019 08:58:35. | |
| xxd | File | 21032 bytes | October 12 2025 00:23:15. | |
| yes | File | 34168 bytes | December 18 2025 13:54:40. | |
| zcat | File | 1983 bytes | April 27 2022 05:49:28. | |
| zcmp | File | 1677 bytes | April 27 2022 05:49:28. | |
| zdiff | File | 5879 bytes | April 27 2022 05:49:28. | |
| zegrep | File | 29 bytes | April 27 2022 05:49:28. | |
| zfgrep | File | 29 bytes | April 27 2022 05:49:28. | |
| zforce | File | 2080 bytes | April 27 2022 05:49:28. | |
| zgrep | File | 7582 bytes | April 27 2022 05:49:28. | |
| zip | File | 234496 bytes | October 11 2019 13:11:04. | |
| zipcloak | File | 105376 bytes | October 11 2019 13:11:04. | |
| zipgrep | File | 2953 bytes | October 10 2008 17:40:36. | |
| zipinfo | File | 206728 bytes | June 03 2025 02:10:12. | |
| zipnote | File | 100104 bytes | October 11 2019 13:11:04. | |
| zipsplit | File | 100104 bytes | October 11 2019 13:11:04. | |
| zless | File | 2205 bytes | April 27 2022 05:49:28. | |
| zmore | File | 1841 bytes | April 27 2022 05:49:28. | |
| znew | File | 4552 bytes | April 27 2022 05:49:28. | |
| zsoelim | File | 43576 bytes | October 13 2019 14:29:41. |
#!/usr/bin/perl
BEGIN {
# @INC poking no longer needed w/ new MakeMaker and Makefile.PL's
# with $ENV{PERL_CORE} set
# In case we need it in future...
require Config; import Config;
pop @INC if $INC[-1] eq '.';
}
use strict;
use warnings;
use Getopt::Std;
use Config;
my @orig_ARGV = @ARGV;
our $VERSION = do { my @r = (q$Revision: 2.21 $ =~ /\d+/g); sprintf "%d."."%02d" x $#r, @r };
# These may get re-ordered.
# RAW is a do_now as inserted by &enter
# AGG is an aggregated do_now, as built up by &process
use constant {
RAW_NEXT => 0,
RAW_IN_LEN => 1,
RAW_OUT_BYTES => 2,
RAW_FALLBACK => 3,
AGG_MIN_IN => 0,
AGG_MAX_IN => 1,
AGG_OUT_BYTES => 2,
AGG_NEXT => 3,
AGG_IN_LEN => 4,
AGG_OUT_LEN => 5,
AGG_FALLBACK => 6,
};
# (See the algorithm in encengine.c - we're building structures for it)
# There are two sorts of structures.
# "do_now" (an array, two variants of what needs storing) is whatever we need
# to do now we've read an input byte.
# It's housed in a "do_next" (which is how we got to it), and in turn points
# to a "do_next" which contains all the "do_now"s for the next input byte.
# There will be a "do_next" which is the start state.
# For a single byte encoding it's the only "do_next" - each "do_now" points
# back to it, and each "do_now" will cause bytes. There is no state.
# For a multi-byte encoding where all characters in the input are the same
# length, then there will be a tree of "do_now"->"do_next"->"do_now"
# branching out from the start state, one step for each input byte.
# The leaf "do_now"s will all be at the same distance from the start state,
# only the leaf "do_now"s cause output bytes, and they in turn point back to
# the start state.
# For an encoding where there are variable length input byte sequences, you
# will encounter a leaf "do_now" sooner for the shorter input sequences, but
# as before the leaves will point back to the start state.
# The system will cope with escape encodings (imagine them as a mostly
# self-contained tree for each escape state, and cross links between trees
# at the state-switching characters) but so far no input format defines these.
# The system will also cope with having output "leaves" in the middle of
# the bifurcating branches, not just at the extremities, but again no
# input format does this yet.
# There are two variants of the "do_now" structure. The first, smaller variant
# is generated by &enter as the input file is read. There is one structure
# for each input byte. Say we are mapping a single byte encoding to a
# single byte encoding, with "ABCD" going "abcd". There will be
# 4 "do_now"s, {"A" => [...,"a",...], "B" => [...,"b",...], "C"=>..., "D"=>...}
# &process then walks the tree, building aggregate "do_now" structures for
# adjacent bytes where possible. The aggregate is for a contiguous range of
# bytes which each produce the same length of output, each move to the
# same next state, and each have the same fallback flag.
# So our 4 RAW "do_now"s above become replaced by a single structure
# containing:
# ["A", "D", "abcd", 1, ...]
# ie, for an input byte $_ in "A".."D", output 1 byte, found as
# substr ("abcd", (ord $_ - ord "A") * 1, 1)
# which maps very nicely into pointer arithmetic in C for encengine.c
sub encode_U
{
# UTF-8 encode long hand - only covers part of perl's range
## my $uv = shift;
# chr() works in native space so convert value from table
# into that space before using chr().
my $ch = chr(utf8::unicode_to_native($_[0]));
# Now get core perl to encode that the way it likes.
utf8::encode($ch);
return $ch;
}
sub encode_S
{
# encode single byte
## my ($ch,$page) = @_; return chr($ch);
return chr $_[0];
}
sub encode_D
{
# encode double byte MS byte first
## my ($ch,$page) = @_; return chr($page).chr($ch);
return chr ($_[1]) . chr $_[0];
}
sub encode_M
{
# encode Multi-byte - single for 0..255 otherwise double
## my ($ch,$page) = @_;
## return &encode_D if $page;
## return &encode_S;
return chr ($_[1]) . chr $_[0] if $_[1];
return chr $_[0];
}
my %encode_types = (U => \&encode_U,
S => \&encode_S,
D => \&encode_D,
M => \&encode_M,
);
# Win32 does not expand globs on command line
if ($^O eq 'MSWin32' and !$ENV{PERL_CORE}) {
eval "\@ARGV = map(glob(\$_),\@ARGV)";
@ARGV = @orig_ARGV unless @ARGV;
}
my %opt;
# I think these are:
# -Q to disable the duplicate codepoint test
# -S make mapping errors fatal
# -q to remove comments written to output files
# -O to enable the (brute force) substring optimiser
# -o <output> to specify the output file name (else it's the first arg)
# -f <inlist> to give a file with a list of input files (else use the args)
# -n <name> to name the encoding (else use the basename of the input file.
#Getopt::Long::Configure("bundling");
#GetOptions(\%opt, qw(C M=s S Q q O o=s f=s n=s v));
getopts('CM:SQqOo:f:n:v',\%opt);
$opt{M} and make_makefile_pl($opt{M}, @ARGV);
$opt{C} and make_configlocal_pm($opt{C}, @ARGV);
$opt{v} ||= $ENV{ENC2XS_VERBOSE};
sub verbose {
print STDERR @_ if $opt{v};
}
sub verbosef {
printf STDERR @_ if $opt{v};
}
# ($cpp, $static, $sized) = compiler_info($declaration)
#
# return some information about the compiler and compile options we're using:
#
# $declaration - true if we're doing a declaration rather than a definition.
#
# $cpp - we're using C++
# $static - ok to declare the arrays as static
# $sized - the array declarations should be sized
sub compiler_info {
my ($declaration) = @_;
my $ccflags = $Config{ccflags};
if (defined $Config{ccwarnflags}) {
$ccflags .= " " . $Config{ccwarnflags};
}
my $compat = $ccflags =~ /\Q-Wc++-compat/;
my $pedantic = $ccflags =~ /-pedantic/;
my $cpp = ($Config{d_cplusplus} || '') eq 'define';
# The encpage_t tables contain recursive and mutually recursive
# references. To allow them to compile under C++ and some restrictive
# cc options, it may be necessary to make the tables non-static/const
# (thus moving them from the text to the data segment) and/or not
# include the size in the declaration.
my $static = !(
$cpp
|| ($compat && $pedantic)
|| ($^O eq 'MacOS' && $declaration)
);
# -Wc++-compat on its own warns if the array declaration is sized.
# The easiest way to avoid this warning is simply not to include
# the size in the declaration.
# With -pedantic as well, the issue doesn't arise because $static
# above becomes false.
my $sized = $declaration && !($compat && !$pedantic);
return ($cpp, $static, $sized);
}
# This really should go first, else the die here causes empty (non-erroneous)
# output files to be written.
my @encfiles;
if (exists $opt{f}) {
# -F is followed by name of file containing list of filenames
my $flist = $opt{f};
open(FLIST,$flist) || die "Cannot open $flist:$!";
chomp(@encfiles = <FLIST>);
close(FLIST);
} else {
@encfiles = @ARGV;
}
my $cname = $opt{o} ? $opt{o} : shift(@ARGV);
unless ($cname) { #debuging a win32 nmake error-only. works via cmdline
print "\nARGV:";
print "$_ " for @ARGV;
print "\nopt:";
print " $_ => ",defined $opt{$_}?$opt{$_}:"undef","\n" for keys %opt;
}
chmod(0666,$cname) if -f $cname && !-w $cname;
open(C,">", $cname) || die "Cannot open $cname:$!";
my $dname = $cname;
my $hname = $cname;
my ($doC,$doEnc,$doUcm,$doPet);
if ($cname =~ /\.(c|xs)$/i) # VMS may have upcased filenames with DECC$ARGV_PARSE_STYLE defined
{
$doC = 1;
$dname =~ s/(\.[^\.]*)?$/.exh/;
chmod(0666,$dname) if -f $cname && !-w $dname;
open(D,">", $dname) || die "Cannot open $dname:$!";
$hname =~ s/(\.[^\.]*)?$/.h/;
chmod(0666,$hname) if -f $cname && !-w $hname;
open(H,">", $hname) || die "Cannot open $hname:$!";
foreach my $fh (\*C,\*D,\*H)
{
print $fh <<"END" unless $opt{'q'};
/*
!!!!!!! DO NOT EDIT THIS FILE !!!!!!!
This file was autogenerated by:
$^X $0 @orig_ARGV
enc2xs VERSION $VERSION
*/
END
}
if ($cname =~ /(\w+)\.xs$/)
{
print C "#define PERL_NO_GET_CONTEXT\n";
print C "#include <EXTERN.h>\n";
print C "#include <perl.h>\n";
print C "#include <XSUB.h>\n";
}
print C "#include \"encode.h\"\n\n";
}
elsif ($cname =~ /\.enc$/)
{
$doEnc = 1;
}
elsif ($cname =~ /\.ucm$/)
{
$doUcm = 1;
}
elsif ($cname =~ /\.pet$/)
{
$doPet = 1;
}
my %encoding;
my %strings;
my $string_acc;
my %strings_in_acc;
my $saved = 0;
my $subsave = 0;
my $strings = 0;
sub cmp_name
{
if ($a =~ /^.*-(\d+)/)
{
my $an = $1;
if ($b =~ /^.*-(\d+)/)
{
my $r = $an <=> $1;
return $r if $r;
}
}
return $a cmp $b;
}
foreach my $enc (sort cmp_name @encfiles)
{
my ($name,$sfx) = $enc =~ /^.*?([\w-]+)\.(enc|ucm)$/;
$name = $opt{'n'} if exists $opt{'n'};
if (open(E,$enc))
{
if ($sfx eq 'enc')
{
compile_enc(\*E,lc($name));
}
else
{
compile_ucm(\*E,lc($name));
}
}
else
{
warn "Cannot open $enc for $name:$!";
}
}
if ($doC)
{
verbose "Writing compiled form\n";
foreach my $name (sort cmp_name keys %encoding)
{
my ($e2u,$u2e,$erep,$min_el,$max_el) = @{$encoding{$name}};
process($name.'_utf8',$e2u);
addstrings(\*C,$e2u);
process('utf8_'.$name,$u2e);
addstrings(\*C,$u2e);
}
outbigstring(\*C,"enctable");
foreach my $name (sort cmp_name keys %encoding)
{
my ($e2u,$u2e,$erep,$min_el,$max_el) = @{$encoding{$name}};
outtable(\*C,$e2u, "enctable");
outtable(\*C,$u2e, "enctable");
# push(@{$encoding{$name}},outstring(\*C,$e2u->{Cname}.'_def',$erep));
}
my ($cpp) = compiler_info(0);
my $ext = $cpp ? 'extern "C"' : "extern";
my $exta = $cpp ? 'extern "C"' : "static";
my $extb = $cpp ? 'extern "C"' : "";
foreach my $enc (sort cmp_name keys %encoding)
{
# my ($e2u,$u2e,$rep,$min_el,$max_el,$rsym) = @{$encoding{$enc}};
my ($e2u,$u2e,$rep,$min_el,$max_el) = @{$encoding{$enc}};
#my @info = ($e2u->{Cname},$u2e->{Cname},$rsym,length($rep),$min_el,$max_el);
my $replen = 0;
$replen++ while($rep =~ /\G\\x[0-9A-Fa-f]/g);
my $sym = "${enc}_encoding";
$sym =~ s/\W+/_/g;
my @info = ($e2u->{Cname},$u2e->{Cname},"${sym}_rep_character",$replen,
$min_el,$max_el);
print C "${exta} const U8 ${sym}_rep_character[] = \"$rep\";\n";
print C "${exta} const char ${sym}_enc_name[] = \"$enc\";\n\n";
print C "${extb} const encode_t $sym = \n";
# This is to make null encoding work -- dankogai
for (my $i = (scalar @info) - 1; $i >= 0; --$i){
$info[$i] ||= 1;
}
# end of null tweak -- dankogai
print C " {",join(',',@info,"{${sym}_enc_name,(const char *)0}"),"};\n\n";
}
foreach my $enc (sort cmp_name keys %encoding)
{
my $sym = "${enc}_encoding";
$sym =~ s/\W+/_/g;
print H "${ext} encode_t $sym;\n";
print D " Encode_XSEncoding(aTHX_ &$sym);\n";
}
if ($cname =~ /(\w+)\.xs$/)
{
my $mod = $1;
print C <<'END';
static void
Encode_XSEncoding(pTHX_ encode_t *enc)
{
dSP;
HV *stash = gv_stashpv("Encode::XS", TRUE);
SV *iv = newSViv(PTR2IV(enc));
SV *sv = sv_bless(newRV_noinc(iv),stash);
int i = 0;
/* with the SvLEN() == 0 hack, PVX won't be freed. We cast away name's
constness, in the hope that perl won't mess with it. */
assert(SvTYPE(iv) >= SVt_PV); assert(SvLEN(iv) == 0);
SvFLAGS(iv) |= SVp_POK;
SvPVX(iv) = (char*) enc->name[0];
PUSHMARK(sp);
XPUSHs(sv);
while (enc->name[i])
{
const char *name = enc->name[i++];
XPUSHs(sv_2mortal(newSVpvn(name,strlen(name))));
}
PUTBACK;
call_pv("Encode::define_encoding",G_DISCARD);
SvREFCNT_dec(sv);
}
END
print C "\nMODULE = Encode::$mod\tPACKAGE = Encode::$mod\n\n";
print C "BOOT:\n{\n";
print C "#include \"$dname\"\n";
print C "}\n";
}
# Close in void context is bad, m'kay
close(D) or warn "Error closing '$dname': $!";
close(H) or warn "Error closing '$hname': $!";
my $perc_saved = $saved/($strings + $saved) * 100;
my $perc_subsaved = $subsave/($strings + $subsave) * 100;
verbosef "%d bytes in string tables\n",$strings;
verbosef "%d bytes (%.3g%%) saved spotting duplicates\n",
$saved, $perc_saved if $saved;
verbosef "%d bytes (%.3g%%) saved using substrings\n",
$subsave, $perc_subsaved if $subsave;
}
elsif ($doEnc)
{
foreach my $name (sort cmp_name keys %encoding)
{
my ($e2u,$u2e,$erep,$min_el,$max_el) = @{$encoding{$name}};
output_enc(\*C,$name,$e2u);
}
}
elsif ($doUcm)
{
foreach my $name (sort cmp_name keys %encoding)
{
my ($e2u,$u2e,$erep,$min_el,$max_el) = @{$encoding{$name}};
output_ucm(\*C,$name,$u2e,$erep,$min_el,$max_el);
}
}
# writing half meg files and then not checking to see if you just filled the
# disk is bad, m'kay
close(C) or die "Error closing '$cname': $!";
# End of the main program.
sub compile_ucm
{
my ($fh,$name) = @_;
my $e2u = {};
my $u2e = {};
my $cs;
my %attr;
while (<$fh>)
{
s/#.*$//;
last if /^\s*CHARMAP\s*$/i;
if (/^\s*<(\w+)>\s+"?([^"]*)"?\s*$/i) # " # Grrr
{
$attr{$1} = $2;
}
}
if (!defined($cs = $attr{'code_set_name'}))
{
warn "No <code_set_name> in $name\n";
}
else
{
$name = $cs unless exists $opt{'n'};
}
my $erep;
my $urep;
my $max_el;
my $min_el;
if (exists $attr{'subchar'})
{
#my @byte;
#$attr{'subchar'} =~ /^\s*/cg;
#push(@byte,$1) while $attr{'subchar'} =~ /\G\\x([0-9a-f]+)/icg;
#$erep = join('',map(chr(hex($_)),@byte));
$erep = $attr{'subchar'};
$erep =~ s/^\s+//; $erep =~ s/\s+$//;
}
print "Reading $name ($cs)\n"
unless defined $ENV{MAKEFLAGS}
and $ENV{MAKEFLAGS} =~ /\b(s|silent|quiet)\b/;
my $nfb = 0;
my $hfb = 0;
while (<$fh>)
{
s/#.*$//;
last if /^\s*END\s+CHARMAP\s*$/i;
next if /^\s*$/;
my (@uni, @byte) = ();
my ($uni, $byte, $fb) = m/^(\S+)\s+(\S+)\s+(\S+)\s+/o
or die "Bad line: $_";
while ($uni =~ m/\G<([U0-9a-fA-F\+]+)>/g){
push @uni, map { substr($_, 1) } split(/\+/, $1);
}
while ($byte =~ m/\G\\x([0-9a-fA-F]+)/g){
push @byte, $1;
}
if (@uni)
{
my $uch = join('', map { encode_U(hex($_)) } @uni );
my $ech = join('',map(chr(hex($_)),@byte));
my $el = length($ech);
$max_el = $el if (!defined($max_el) || $el > $max_el);
$min_el = $el if (!defined($min_el) || $el < $min_el);
if (length($fb))
{
$fb = substr($fb,1);
$hfb++;
}
else
{
$nfb++;
$fb = '0';
}
# $fb is fallback flag
# 0 - round trip safe
# 1 - fallback for unicode -> enc
# 2 - skip sub-char mapping
# 3 - fallback enc -> unicode
enter($u2e,$uch,$ech,$u2e,$fb+0) if ($fb =~ /[01]/);
enter($e2u,$ech,$uch,$e2u,$fb+0) if ($fb =~ /[03]/);
}
else
{
warn $_;
}
}
if ($nfb && $hfb)
{
die "$nfb entries without fallback, $hfb entries with\n";
}
$encoding{$name} = [$e2u,$u2e,$erep,$min_el,$max_el];
}
sub compile_enc
{
my ($fh,$name) = @_;
my $e2u = {};
my $u2e = {};
my $type;
while ($type = <$fh>)
{
last if $type !~ /^\s*#/;
}
chomp($type);
return if $type eq 'E';
# Do the hash lookup once, rather than once per function call. 4% speedup.
my $type_func = $encode_types{$type};
my ($def,$sym,$pages) = split(/\s+/,scalar(<$fh>));
warn "$type encoded $name\n";
my $rep = '';
# Save a defined test by setting these to defined values.
my $min_el = ~0; # A very big integer
my $max_el = 0; # Anything must be longer than 0
{
my $v = hex($def);
$rep = &$type_func($v & 0xFF, ($v >> 8) & 0xffe);
}
my $errors;
my $seen;
# use -Q to silence the seen test. Makefile.PL uses this by default.
$seen = {} unless $opt{Q};
do
{
my $line = <$fh>;
chomp($line);
my $page = hex($line);
my $ch = 0;
my $i = 16;
do
{
# So why is it 1% faster to leave the my here?
my $line = <$fh>;
$line =~ s/\r\n$/\n/;
die "$.:${line}Line should be exactly 65 characters long including
newline (".length($line).")" unless length ($line) == 65;
# Split line into groups of 4 hex digits, convert groups to ints
# This takes 65.35
# map {hex $_} $line =~ /(....)/g
# This takes 63.75 (2.5% less time)
# unpack "n*", pack "H*", $line
# There's an implicit loop in map. Loops are bad, m'kay. Ops are bad, m'kay
# Doing it as while ($line =~ /(....)/g) took 74.63
foreach my $val (unpack "n*", pack "H*", $line)
{
next if $val == 0xFFFD;
my $ech = &$type_func($ch,$page);
if ($val || (!$ch && !$page))
{
my $el = length($ech);
$max_el = $el if $el > $max_el;
$min_el = $el if $el < $min_el;
my $uch = encode_U($val);
if ($seen) {
# We're doing the test.
# We don't need to read this quickly, so storing it as a scalar,
# rather than 3 (anon array, plus the 2 scalars it holds) saves
# RAM and may make us faster on low RAM systems. [see __END__]
if (exists $seen->{$uch})
{
warn sprintf("U%04X is %02X%02X and %04X\n",
$val,$page,$ch,$seen->{$uch});
$errors++;
}
else
{
$seen->{$uch} = $page << 8 | $ch;
}
}
# Passing 2 extra args each time is 3.6% slower!
# Even with having to add $fallback ||= 0 later
enter_fb0($e2u,$ech,$uch);
enter_fb0($u2e,$uch,$ech);
}
else
{
# No character at this position
# enter($e2u,$ech,undef,$e2u);
}
$ch++;
}
} while --$i;
} while --$pages;
die "\$min_el=$min_el, \$max_el=$max_el - seems we read no lines"
if $min_el > $max_el;
die "$errors mapping conflicts\n" if ($errors && $opt{'S'});
$encoding{$name} = [$e2u,$u2e,$rep,$min_el,$max_el];
}
# my ($a,$s,$d,$t,$fb) = @_;
sub enter {
my ($current,$inbytes,$outbytes,$next,$fallback) = @_;
# state we shift to after this (multibyte) input character defaults to same
# as current state.
$next ||= $current;
# Making sure it is defined seems to be faster than {no warnings;} in
# &process, or passing it in as 0 explicitly.
# XXX $fallback ||= 0;
# Start at the beginning and work forwards through the string to zero.
# effectively we are removing 1 character from the front each time
# but we don't actually edit the string. [this alone seems to be 14% speedup]
# Hence -$pos is the length of the remaining string.
my $pos = -length $inbytes;
while (1) {
my $byte = substr $inbytes, $pos, 1;
# RAW_NEXT => 0,
# RAW_IN_LEN => 1,
# RAW_OUT_BYTES => 2,
# RAW_FALLBACK => 3,
# to unicode an array would seem to be better, because the pages are dense.
# from unicode can be very sparse, favouring a hash.
# hash using the bytes (all length 1) as keys rather than ord value,
# as it's easier to sort these in &process.
# It's faster to always add $fallback even if it's undef, rather than
# choosing between 3 and 4 element array. (hence why we set it defined
# above)
my $do_now = $current->{Raw}{$byte} ||= [{},-$pos,'',$fallback];
# When $pos was -1 we were at the last input character.
unless (++$pos) {
$do_now->[RAW_OUT_BYTES] = $outbytes;
$do_now->[RAW_NEXT] = $next;
return;
}
# Tail recursion. The intermediate state may not have a name yet.
$current = $do_now->[RAW_NEXT];
}
}
# This is purely for optimisation. It's just &enter hard coded for $fallback
# of 0, using only a 3 entry array ref to save memory for every entry.
sub enter_fb0 {
my ($current,$inbytes,$outbytes,$next) = @_;
$next ||= $current;
my $pos = -length $inbytes;
while (1) {
my $byte = substr $inbytes, $pos, 1;
my $do_now = $current->{Raw}{$byte} ||= [{},-$pos,''];
unless (++$pos) {
$do_now->[RAW_OUT_BYTES] = $outbytes;
$do_now->[RAW_NEXT] = $next;
return;
}
$current = $do_now->[RAW_NEXT];
}
}
sub process
{
my ($name,$a) = @_;
$name =~ s/\W+/_/g;
$a->{Cname} = $name;
my $raw = $a->{Raw};
my ($l, $agg_max_in, $agg_next, $agg_in_len, $agg_out_len, $agg_fallback);
my @ent;
$agg_max_in = 0;
foreach my $key (sort keys %$raw) {
# RAW_NEXT => 0,
# RAW_IN_LEN => 1,
# RAW_OUT_BYTES => 2,
# RAW_FALLBACK => 3,
my ($next, $in_len, $out_bytes, $fallback) = @{$raw->{$key}};
# Now we are converting from raw to aggregate, switch from 1 byte strings
# to numbers
my $b = ord $key;
$fallback ||= 0;
if ($l &&
# If this == fails, we're going to reset $agg_max_in below anyway.
$b == ++$agg_max_in &&
# References in numeric context give the pointer as an int.
$agg_next == $next &&
$agg_in_len == $in_len &&
$agg_out_len == length $out_bytes &&
$agg_fallback == $fallback
# && length($l->[AGG_OUT_BYTES]) < 16
) {
# my $i = ord($b)-ord($l->[AGG_MIN_IN]);
# we can aggregate this byte onto the end.
$l->[AGG_MAX_IN] = $b;
$l->[AGG_OUT_BYTES] .= $out_bytes;
} else {
# AGG_MIN_IN => 0,
# AGG_MAX_IN => 1,
# AGG_OUT_BYTES => 2,
# AGG_NEXT => 3,
# AGG_IN_LEN => 4,
# AGG_OUT_LEN => 5,
# AGG_FALLBACK => 6,
# Reset the last thing we saw, plus set 5 lexicals to save some derefs.
# (only gains .6% on euc-jp -- is it worth it?)
push @ent, $l = [$b, $agg_max_in = $b, $out_bytes, $agg_next = $next,
$agg_in_len = $in_len, $agg_out_len = length $out_bytes,
$agg_fallback = $fallback];
}
if (exists $next->{Cname}) {
$next->{'Forward'} = 1 if $next != $a;
} else {
process(sprintf("%s_%02x",$name,$b),$next);
}
}
# encengine.c rules say that last entry must be for 255
if ($agg_max_in < 255) {
push @ent, [1+$agg_max_in, 255,undef,$a,0,0];
}
$a->{'Entries'} = \@ent;
}
sub addstrings
{
my ($fh,$a) = @_;
my $name = $a->{'Cname'};
# String tables
foreach my $b (@{$a->{'Entries'}})
{
next unless $b->[AGG_OUT_LEN];
$strings{$b->[AGG_OUT_BYTES]} = undef;
}
if ($a->{'Forward'})
{
my ($cpp, $static, $sized) = compiler_info(1);
my $count = $sized ? scalar(@{$a->{'Entries'}}) : '';
if ($static) {
# we cannot ask Config for d_plusplus since we can override CC=g++-6 on the cmdline
print $fh "#ifdef __cplusplus\n"; # -fpermissive since g++-6
print $fh "extern encpage_t $name\[$count];\n";
print $fh "#else\n";
print $fh "static const encpage_t $name\[$count];\n";
print $fh "#endif\n";
} else {
print $fh "extern encpage_t $name\[$count];\n";
}
}
$a->{'DoneStrings'} = 1;
foreach my $b (@{$a->{'Entries'}})
{
my ($s,$e,$out,$t,$end,$l) = @$b;
addstrings($fh,$t) unless $t->{'DoneStrings'};
}
}
sub outbigstring
{
my ($fh,$name) = @_;
$string_acc = '';
# Make the big string in the string accumulator. Longest first, on the hope
# that this makes it more likely that we find the short strings later on.
# Not sure if it helps sorting strings of the same length lexically.
foreach my $s (sort {length $b <=> length $a || $a cmp $b} keys %strings) {
my $index = index $string_acc, $s;
if ($index >= 0) {
$saved += length($s);
$strings_in_acc{$s} = $index;
} else {
OPTIMISER: {
if ($opt{'O'}) {
my $sublength = length $s;
while (--$sublength > 0) {
# progressively lop characters off the end, to see if the start of
# the new string overlaps the end of the accumulator.
if (substr ($string_acc, -$sublength)
eq substr ($s, 0, $sublength)) {
$subsave += $sublength;
$strings_in_acc{$s} = length ($string_acc) - $sublength;
# append the last bit on the end.
$string_acc .= substr ($s, $sublength);
last OPTIMISER;
}
# or if the end of the new string overlaps the start of the
# accumulator
next unless substr ($string_acc, 0, $sublength)
eq substr ($s, -$sublength);
# well, the last $sublength characters of the accumulator match.
# so as we're prepending to the accumulator, need to shift all our
# existing offsets forwards
$_ += $sublength foreach values %strings_in_acc;
$subsave += $sublength;
$strings_in_acc{$s} = 0;
# append the first bit on the start.
$string_acc = substr ($s, 0, -$sublength) . $string_acc;
last OPTIMISER;
}
}
# Optimiser (if it ran) found nothing, so just going have to tack the
# whole thing on the end.
$strings_in_acc{$s} = length $string_acc;
$string_acc .= $s;
};
}
}
$strings = length $string_acc;
my ($cpp) = compiler_info(0);
my $var = $cpp ? '' : 'static';
my $definition = "\n$var const U8 $name\[$strings] = { " .
join(',',unpack "C*",$string_acc);
# We have a single long line. Split it at convenient commas.
print $fh $1, "\n" while $definition =~ /\G(.{74,77},)/gcs;
print $fh substr ($definition, pos $definition), " };\n";
}
sub findstring {
my ($name,$s) = @_;
my $offset = $strings_in_acc{$s};
die "Can't find string " . join (',',unpack "C*",$s) . " in accumulator"
unless defined $offset;
"$name + $offset";
}
sub outtable
{
my ($fh,$a,$bigname) = @_;
my $name = $a->{'Cname'};
$a->{'Done'} = 1;
foreach my $b (@{$a->{'Entries'}})
{
my ($s,$e,$out,$t,$end,$l) = @$b;
outtable($fh,$t,$bigname) unless $t->{'Done'};
}
my ($cpp, $static) = compiler_info(0);
my $count = scalar(@{$a->{'Entries'}});
if ($static) {
print $fh "#ifdef __cplusplus\n"; # -fpermissive since g++-6
print $fh "encpage_t $name\[$count] = {\n";
print $fh "#else\n";
print $fh "static const encpage_t $name\[$count] = {\n";
print $fh "#endif\n";
} else {
print $fh "\nencpage_t $name\[$count] = {\n";
}
foreach my $b (@{$a->{'Entries'}})
{
my ($sc,$ec,$out,$t,$end,$l,$fb) = @$b;
# $end |= 0x80 if $fb; # what the heck was on your mind, Nick? -- Dan
print $fh "{";
if ($l)
{
printf $fh findstring($bigname,$out);
}
else
{
print $fh "0";
}
print $fh ",",$t->{Cname};
printf $fh ",0x%02x,0x%02x,$l,$end},\n",$sc,$ec;
}
print $fh "};\n";
}
sub output_enc
{
my ($fh,$name,$a) = @_;
die "Changed - fix me for new structure";
foreach my $b (sort keys %$a)
{
my ($s,$e,$out,$t,$end,$l,$fb) = @{$a->{$b}};
}
}
sub decode_U
{
my $s = shift;
}
my @uname;
sub char_names
{
my $s = do "unicore/Name.pl";
die "char_names: unicore/Name.pl: $!\n" unless defined $s;
pos($s) = 0;
while ($s =~ /\G([0-9a-f]+)\t([0-9a-f]*)\t(.*?)\s*\n/igc)
{
my $name = $3;
my $s = hex($1);
last if $s >= 0x10000;
my $e = length($2) ? hex($2) : $s;
for (my $i = $s; $i <= $e; $i++)
{
$uname[$i] = $name;
# print sprintf("U%04X $name\n",$i);
}
}
}
sub output_ucm_page
{
my ($cmap,$a,$t,$pre) = @_;
# warn sprintf("Page %x\n",$pre);
my $raw = $t->{Raw};
foreach my $key (sort keys %$raw) {
# RAW_NEXT => 0,
# RAW_IN_LEN => 1,
# RAW_OUT_BYTES => 2,
# RAW_FALLBACK => 3,
my ($next, $in_len, $out_bytes, $fallback) = @{$raw->{$key}};
my $u = ord $key;
$fallback ||= 0;
if ($next != $a && $next != $t) {
output_ucm_page($cmap,$a,$next,(($pre|($u &0x3F)) << 6)&0xFFFF);
} elsif (length $out_bytes) {
if ($pre) {
$u = $pre|($u &0x3f);
}
my $s = sprintf "<U%04X> ",$u;
#foreach my $c (split(//,$out_bytes)) {
# $s .= sprintf "\\x%02X",ord($c);
#}
# 9.5% faster changing that loop to this:
$s .= sprintf +("\\x%02X" x length $out_bytes), unpack "C*", $out_bytes;
$s .= sprintf " |%d # %s\n",($fallback ? 1 : 0),$uname[$u];
push(@$cmap,$s);
} else {
warn join(',',$u, @{$raw->{$key}},$a,$t);
}
}
}
sub output_ucm
{
my ($fh,$name,$h,$rep,$min_el,$max_el) = @_;
print $fh "# $0 @orig_ARGV\n" unless $opt{'q'};
print $fh "<code_set_name> \"$name\"\n";
char_names();
if (defined $min_el)
{
print $fh "<mb_cur_min> $min_el\n";
}
if (defined $max_el)
{
print $fh "<mb_cur_max> $max_el\n";
}
if (defined $rep)
{
print $fh "<subchar> ";
foreach my $c (split(//,$rep))
{
printf $fh "\\x%02X",ord($c);
}
print $fh "\n";
}
my @cmap;
output_ucm_page(\@cmap,$h,$h,0);
print $fh "#\nCHARMAP\n";
foreach my $line (sort { substr($a,8) cmp substr($b,8) } @cmap)
{
print $fh $line;
}
print $fh "END CHARMAP\n";
}
use vars qw(
$_Enc2xs
$_Version
$_Inc
$_E2X
$_Name
$_TableFiles
$_Now
);
sub find_e2x{
eval { require File::Find; };
my (@inc, %e2x_dir);
for my $inc (@INC){
push @inc, $inc unless $inc eq '.'; #skip current dir
}
File::Find::find(
sub {
my ($dev,$ino,$mode,$nlink,$uid,$gid,$rdev,$size,
$atime,$mtime,$ctime,$blksize,$blocks)
= lstat($_) or return;
-f _ or return;
if (/^.*\.e2x$/o){
no warnings 'once';
$e2x_dir{$File::Find::dir} ||= $mtime;
}
return;
}, @inc);
warn join("\n", keys %e2x_dir), "\n";
for my $d (sort {$e2x_dir{$a} <=> $e2x_dir{$b}} keys %e2x_dir){
$_E2X = $d;
# warn "$_E2X => ", scalar localtime($e2x_dir{$d});
return $_E2X;
}
}
sub make_makefile_pl
{
eval { require Encode } or die "You need to install Encode to use enc2xs -M\nerror: $@\n";
# our used for variable expansion
$_Enc2xs = $0;
$_Version = $VERSION;
$_E2X = find_e2x();
$_Name = shift;
$_TableFiles = join(",", map {qq('$_')} @_);
$_Now = scalar localtime();
eval { require File::Spec; };
_print_expand(File::Spec->catfile($_E2X,"Makefile_PL.e2x"),"Makefile.PL");
_print_expand(File::Spec->catfile($_E2X,"_PM.e2x"), "$_Name.pm");
_print_expand(File::Spec->catfile($_E2X,"_T.e2x"), "t/$_Name.t");
_print_expand(File::Spec->catfile($_E2X,"README.e2x"), "README");
_print_expand(File::Spec->catfile($_E2X,"Changes.e2x"), "Changes");
exit;
}
use vars qw(
$_ModLines
$_LocalVer
);
sub make_configlocal_pm {
eval { require Encode } or die "Unable to require Encode: $@\n";
eval { require File::Spec; };
# our used for variable expantion
my %in_core = map { $_ => 1 } (
'ascii', 'iso-8859-1', 'utf8',
'ascii-ctrl', 'null', 'utf-8-strict'
);
my %LocalMod = ();
# check @enc;
use File::Find ();
my $wanted = sub{
-f $_ or return;
$File::Find::name =~ /\A\./ and return;
$File::Find::name =~ /\.pm\z/ or return;
$File::Find::name =~ m/\bEncode\b/ or return;
my $mod = $File::Find::name;
$mod =~ s/.*\bEncode\b/Encode/o;
$mod =~ s/\.pm\z//o;
$mod =~ s,/,::,og;
eval qq{ require $mod; } or return;
warn qq{ require $mod;\n};
for my $enc ( Encode->encodings() ) {
no warnings;
$in_core{$enc} and next;
$Encode::Config::ExtModule{$enc} and next;
$LocalMod{$enc} ||= $mod;
}
};
File::Find::find({wanted => $wanted}, @INC);
$_ModLines = "";
for my $enc ( sort keys %LocalMod ) {
$_ModLines .=
qq(\$Encode::ExtModule{'$enc'} = "$LocalMod{$enc}";\n);
}
warn $_ModLines if $_ModLines;
$_LocalVer = _mkversion();
$_E2X = find_e2x();
$_Inc = $INC{"Encode.pm"};
$_Inc =~ s/\.pm$//o;
_print_expand( File::Spec->catfile( $_E2X, "ConfigLocal_PM.e2x" ),
File::Spec->catfile( $_Inc, "ConfigLocal.pm" ), 1 );
exit;
}
sub _mkversion{
# v-string is now depreciated; use time() instead;
#my ($ss,$mm,$hh,$dd,$mo,$yyyy) = localtime();
#$yyyy += 1900, $mo +=1;
#return sprintf("v%04d.%04d.%04d", $yyyy, $mo*100+$dd, $hh*100+$mm);
return time();
}
sub _print_expand{
eval { require File::Basename } or die "File::Basename needed. Are you on miniperl?;\nerror: $@\n";
File::Basename->import();
my ($src, $dst, $clobber) = @_;
if (!$clobber and -e $dst){
warn "$dst exists. skipping\n";
return;
}
warn "Generating $dst...\n";
open my $in, $src or die "$src : $!";
if ((my $d = dirname($dst)) ne '.'){
-d $d or mkdir $d, 0755 or die "mkdir $d : $!";
}
open my $out, ">", $dst or die "$!";
my $asis = 0;
while (<$in>){
if (/^#### END_OF_HEADER/){
$asis = 1; next;
}
s/(\$_[A-Z][A-Za-z0-9]+)_/$1/gee unless $asis;
print $out $_;
}
}
__END__
=head1 NAME
enc2xs -- Perl Encode Module Generator
=head1 SYNOPSIS
enc2xs -[options]
enc2xs -M ModName mapfiles...
enc2xs -C
=head1 DESCRIPTION
F<enc2xs> builds a Perl extension for use by Encode from either
Unicode Character Mapping files (.ucm) or Tcl Encoding Files (.enc).
Besides being used internally during the build process of the Encode
module, you can use F<enc2xs> to add your own encoding to perl.
No knowledge of XS is necessary.
=head1 Quick Guide
If you want to know as little about Perl as possible but need to
add a new encoding, just read this chapter and forget the rest.
=over 4
=item 0.Z<>
Have a .ucm file ready. You can get it from somewhere or you can write
your own from scratch or you can grab one from the Encode distribution
and customize it. For the UCM format, see the next Chapter. In the
example below, I'll call my theoretical encoding myascii, defined
in I<my.ucm>. C<$> is a shell prompt.
$ ls -F
my.ucm
=item 1.Z<>
Issue a command as follows;
$ enc2xs -M My my.ucm
generating Makefile.PL
generating My.pm
generating README
generating Changes
Now take a look at your current directory. It should look like this.
$ ls -F
Makefile.PL My.pm my.ucm t/
The following files were created.
Makefile.PL - MakeMaker script
My.pm - Encode submodule
t/My.t - test file
=over 4
=item 1.1.Z<>
If you want *.ucm installed together with the modules, do as follows;
$ mkdir Encode
$ mv *.ucm Encode
$ enc2xs -M My Encode/*ucm
=back
=item 2.Z<>
Edit the files generated. You don't have to if you have no time AND no
intention to give it to someone else. But it is a good idea to edit
the pod and to add more tests.
=item 3.Z<>
Now issue a command all Perl Mongers love:
$ perl Makefile.PL
Writing Makefile for Encode::My
=item 4.Z<>
Now all you have to do is make.
$ make
cp My.pm blib/lib/Encode/My.pm
/usr/local/bin/perl /usr/local/bin/enc2xs -Q -O \
-o encode_t.c -f encode_t.fnm
Reading myascii (myascii)
Writing compiled form
128 bytes in string tables
384 bytes (75%) saved spotting duplicates
1 bytes (0.775%) saved using substrings
....
chmod 644 blib/arch/auto/Encode/My/My.bs
$
The time it takes varies depending on how fast your machine is and
how large your encoding is. Unless you are working on something big
like euc-tw, it won't take too long.
=item 5.Z<>
You can "make install" already but you should test first.
$ make test
PERL_DL_NONLAZY=1 /usr/local/bin/perl -Iblib/arch -Iblib/lib \
-e 'use Test::Harness qw(&runtests $verbose); \
$verbose=0; runtests @ARGV;' t/*.t
t/My....ok
All tests successful.
Files=1, Tests=2, 0 wallclock secs
( 0.09 cusr + 0.01 csys = 0.09 CPU)
=item 6.Z<>
If you are content with the test result, just "make install"
=item 7.Z<>
If you want to add your encoding to Encode's demand-loading list
(so you don't have to "use Encode::YourEncoding"), run
enc2xs -C
to update Encode::ConfigLocal, a module that controls local settings.
After that, "use Encode;" is enough to load your encodings on demand.
=back
=head1 The Unicode Character Map
Encode uses the Unicode Character Map (UCM) format for source character
mappings. This format is used by IBM's ICU package and was adopted
by Nick Ing-Simmons for use with the Encode module. Since UCM is
more flexible than Tcl's Encoding Map and far more user-friendly,
this is the recommended format for Encode now.
A UCM file looks like this.
#
# Comments
#
<code_set_name> "US-ascii" # Required
<code_set_alias> "ascii" # Optional
<mb_cur_min> 1 # Required; usually 1
<mb_cur_max> 1 # Max. # of bytes/char
<subchar> \x3F # Substitution char
#
CHARMAP
<U0000> \x00 |0 # <control>
<U0001> \x01 |0 # <control>
<U0002> \x02 |0 # <control>
....
<U007C> \x7C |0 # VERTICAL LINE
<U007D> \x7D |0 # RIGHT CURLY BRACKET
<U007E> \x7E |0 # TILDE
<U007F> \x7F |0 # <control>
END CHARMAP
=over 4
=item *
Anything that follows C<#> is treated as a comment.
=item *
The header section continues until a line containing the word
CHARMAP. This section has a form of I<E<lt>keywordE<gt> value>, one
pair per line. Strings used as values must be quoted. Barewords are
treated as numbers. I<\xXX> represents a byte.
Most of the keywords are self-explanatory. I<subchar> means
substitution character, not subcharacter. When you decode a Unicode
sequence to this encoding but no matching character is found, the byte
sequence defined here will be used. For most cases, the value here is
\x3F; in ASCII, this is a question mark.
=item *
CHARMAP starts the character map section. Each line has a form as
follows:
<UXXXX> \xXX.. |0 # comment
^ ^ ^
| | +- Fallback flag
| +-------- Encoded byte sequence
+-------------- Unicode Character ID in hex
The format is roughly the same as a header section except for the
fallback flag: | followed by 0..3. The meaning of the possible
values is as follows:
=over 4
=item |0
Round trip safe. A character decoded to Unicode encodes back to the
same byte sequence. Most characters have this flag.
=item |1
Fallback for unicode -> encoding. When seen, enc2xs adds this
character for the encode map only.
=item |2
Skip sub-char mapping should there be no code point.
=item |3
Fallback for encoding -> unicode. When seen, enc2xs adds this
character for the decode map only.
=back
=item *
And finally, END OF CHARMAP ends the section.
=back
When you are manually creating a UCM file, you should copy ascii.ucm
or an existing encoding which is close to yours, rather than write
your own from scratch.
When you do so, make sure you leave at least B<U0000> to B<U0020> as
is, unless your environment is EBCDIC.
B<CAVEAT>: not all features in UCM are implemented. For example,
icu:state is not used. Because of that, you need to write a perl
module if you want to support algorithmical encodings, notably
the ISO-2022 series. Such modules include L<Encode::JP::2022_JP>,
L<Encode::KR::2022_KR>, and L<Encode::TW::HZ>.
=head2 Coping with duplicate mappings
When you create a map, you SHOULD make your mappings round-trip safe.
That is, C<encode('your-encoding', decode('your-encoding', $data)) eq
$data> stands for all characters that are marked as C<|0>. Here is
how to make sure:
=over 4
=item *
Sort your map in Unicode order.
=item *
When you have a duplicate entry, mark either one with '|1' or '|3'.
=item *
And make sure the '|1' or '|3' entry FOLLOWS the '|0' entry.
=back
Here is an example from big5-eten.
<U2550> \xF9\xF9 |0
<U2550> \xA2\xA4 |3
Internally Encoding -> Unicode and Unicode -> Encoding Map looks like
this;
E to U U to E
--------------------------------------
\xF9\xF9 => U2550 U2550 => \xF9\xF9
\xA2\xA4 => U2550
So it is round-trip safe for \xF9\xF9. But if the line above is upside
down, here is what happens.
E to U U to E
--------------------------------------
\xA2\xA4 => U2550 U2550 => \xF9\xF9
(\xF9\xF9 => U2550 is now overwritten!)
The Encode package comes with F<ucmlint>, a crude but sufficient
utility to check the integrity of a UCM file. Check under the
Encode/bin directory for this.
When in doubt, you can use F<ucmsort>, yet another utility under
Encode/bin directory.
=head1 Bookmarks
=over 4
=item *
ICU Home Page
L<http://www.icu-project.org/>
=item *
ICU Character Mapping Tables
L<http://site.icu-project.org/charts/charset>
=item *
ICU:Conversion Data
L<http://www.icu-project.org/userguide/conversion-data.html>
=back
=head1 SEE ALSO
L<Encode>,
L<perlmod>,
L<perlpod>
=cut
# -Q to disable the duplicate codepoint test
# -S make mapping errors fatal
# -q to remove comments written to output files
# -O to enable the (brute force) substring optimiser
# -o <output> to specify the output file name (else it's the first arg)
# -f <inlist> to give a file with a list of input files (else use the args)
# -n <name> to name the encoding (else use the basename of the input file.
With %seen holding array refs:
865.66 real 28.80 user 8.79 sys
7904 maximum resident set size
1356 average shared memory size
18566 average unshared data size
229 average unshared stack size
46080 page reclaims
33373 page faults
With %seen holding simple scalars:
342.16 real 27.11 user 3.54 sys
8388 maximum resident set size
1394 average shared memory size
14969 average unshared data size
236 average unshared stack size
28159 page reclaims
9839 page faults
Yes, 5 minutes is faster than 15. Above is for CP936 in CN. Only difference is
how %seen is storing things its seen. So it is pathalogically bad on a 16M
RAM machine, but it's going to help even on modern machines.
Swapping is bad, m'kay :-)
SILENT KILLER Tool