Home » Uncategorized » python slugify:Python slug 化库 把 unicode 转化为 ASCII 汉字转拼音

python slugify:Python slug 化库 把 unicode 转化为 ASCII 汉字转拼音

Python Slugify
status-image version-image  coverage-image
Overview
A Python slugify application that handles unicode.
How to install
Via pip:
$ pip install python-slugify
Via easy_install:
$ easy_install python-slugify
From sources via git:
$ git clone http://github.com/un33k/python-slugify
$ cd python-slugify
$ python setup.py install
From sources:
$ wget https://github.com/un33k/python-slugify/zipball/master
# unzip the downloaded file
# cd into python-slugify-* directory
$ python setup.py install
Note:
By default python-slugify installs unidecode (GPL) for its decoding needs.
Alternatively python-slugify can install and use text-unidecode (GPL & Perl Artistic) instead. This is done by setting up an environment variable SLUGIFY_USES_TEXT_UNIDECODE=yes prior to installing and/or upgrading python-slugify.
In cases where both unidecode and text-unidecode are installed, python-slugify always defaults to using unidecode regardless of the SLUGIFY_USES_TEXT_UNIDECODE=yes environment variable.
How to use
from slugify import slugify
txt = “This is a test —“
r = slugify(txt)
self.assertEqual(r, “this-is-a-test”)
txt = “___This is a test —“
r = slugify(txt)
self.assertEqual(r, “this-is-a-test”)
txt = “___This is a test___”
r = slugify(txt)
self.assertEqual(r, “this-is-a-test”)
txt = “This — is a ## test —“
r = slugify(txt)
self.assertEqual(r, “this-is-a-test”)
txt = ‘影師嗎
r = slugify(txt)
self.assertEqual(r, “ying-shi-ma”)
txt = ‘C\’est déjà l\’été.’
r = slugify(txt)
self.assertEqual(r, “c-est-deja-l-ete”)
txt = ‘Nín hǎo. Wǒ shì zhōng guó rén’
r = slugify(txt)
self.assertEqual(r, “nin-hao-wo-shi-zhong-guo-ren”)
txt = ‘jaja—lol-méméméoo–a’
r = slugify(txt)
self.assertEqual(r, “jaja-lol-mememeoo-a”)
txt = ‘Компьютер’
r = slugify(txt)
self.assertEqual(r, “kompiuter”)
txt = ‘jaja—lol-méméméoo–a’
r = slugify(txt, max_length=9)
self.assertEqual(r, “jaja-lol”)
txt = ‘jaja—lol-méméméoo–a’
r = slugify(txt, max_length=15)
self.assertEqual(r, “jaja-lol-mememe”)
txt = ‘jaja—lol-méméméoo–a’
r = slugify(txt, max_length=50)
self.assertEqual(r, “jaja-lol-mememeoo-a”)
txt = ‘jaja—lol-méméméoo–a’
r = slugify(txt, max_length=15, word_boundary=True)
self.assertEqual(r, “jaja-lol-a”)
txt = ‘jaja—lol-méméméoo–a’
r = slugify(txt, max_length=17, word_boundary=True)
self.assertEqual(r, “jaja-lol-mememeoo”)
txt = ‘jaja—lol-méméméoo–a’
r = slugify(txt, max_length=18, word_boundary=True)
self.assertEqual(r, “jaja-lol-mememeoo”)
txt = ‘jaja—lol-méméméoo–a’
r = slugify(txt, max_length=19, word_boundary=True)
self.assertEqual(r, “jaja-lol-mememeoo-a”)
txt = ‘jaja—lol-méméméoo–a’
r = slugify(txt, max_length=20, word_boundary=True, separator=”.”)
self.assertEqual(r, “jaja.lol.mememeoo.a”)
txt = ‘one two three four five’
r = slugify(txt, max_length=13, word_boundary=True, save_order=True)
self.assertEqual(r, “one-two-three”)
txt = ‘one two three four five’
r = slugify(txt, max_length=13, word_boundary=True, save_order=False)
self.assertEqual(r, “one-two-three”)
txt = ‘one two three four five’
r = slugify(txt, max_length=12, word_boundary=True, save_order=False)
self.assertEqual(r, “one-two-four”)
txt = ‘one two three four five’
r = slugify(txt, max_length=12, word_boundary=True, save_order=True)
self.assertEqual(r, “one-two”)
txt = ‘this has a stopword’
r = slugify(txt, stopwords=[‘stopword’] )
self.assertEqual(r, ‘this-has-a’)
txt = ‘the quick brown fox jumps over the lazy dog’
r = slugify(txt, stopwords=[‘the’] )
self.assertEqual(r, ‘quick-brown-fox-jumps-over-lazy-dog’)
txt = ‘Foo A FOO B foo C’
r = slugify(txt, stopwords=[‘foo’] )
self.assertEqual(r, ‘a-b-c’)
txt = ‘Foo A FOO B foo C’
r = slugify(txt, stopwords=[‘FOO’] )
self.assertEqual(r, ‘a-b-c’)
txt = ‘the quick brown fox jumps over the lazy dog in a hurry’
r = slugify(txt, stopwords=[‘the’, ‘in’, ‘a’, ‘hurry’] )
self.assertEqual(r, ‘quick-brown-fox-jumps-over-lazy-dog’)
txt = ‘thIs Has a stopword Stopword’
r = slugify(txt, stopwords=[‘Stopword’] , lowercase=False)
self.assertEqual(r, ‘thIs-Has-a-stopword’)
txt = ‘foo & bar’
r = slugify(txt)
self.assertEqual(r, ‘foo-bar’)
txt = “___This is a test___”
regex_pattern = r'[^-a-z0-9_] +’
r = slugify(txt, regex_pattern=regex_pattern)
self.assertEqual(r, “___this-is-a-test___”)
txt = “___This is a test___”
regex_pattern = r'[^-a-z0-9_] +’
r = slugify(txt, separator=’_’, regex_pattern=regex_pattern)
self.assertNotEqual(r, “_this_is_a_test_”)
For more examples, have a look at the (TEST) file.
Running the tests
To run the tests against the current environment:
python test.py
License
Released under a (MIT) license.
Note:
python-slugify relies on thirdparty API for decoding unicode strings. This dependency is kept at the public API ONLY in order to ensure that python-slugify never becomes a derivative work of any other packages. MIT license holds.
Version
X.Y.Z Version
`MAJOR` version — when you make incompatible API changes,
`MINOR` version — when you add functionality in a backwards-compatible manner, and
`PATCH` version — when you make backwards-compatible bug fixes.

发表评论

电子邮件地址不会被公开。 必填项已用*标注

*
*