Groups | Search | Server Info | Keyboard shortcuts | Login | Register [http] [https] [nntp] [nntps]


Groups > comp.lang.python > #33972 > unrolled thread

How to sort list of String without considering Special characters and with case insensitive

Started bysan <santosh.ssit@gmail.com>
First post2012-11-27 09:31 -0800
Last post2012-11-27 18:21 +0000
Articles 3 — 3 participants

Back to article view | Back to comp.lang.python


Contents

  How to sort list of String without considering Special characters and with case insensitive san <santosh.ssit@gmail.com> - 2012-11-27 09:31 -0800
    Re: How to sort list of String without considering Special characters and with case insensitive MRAB <python@mrabarnett.plus.com> - 2012-11-27 17:54 +0000
    RE: How to sort list of String without considering Special characters and with case insensitive "Prasad, Ramit" <ramit.prasad@jpmorgan.com> - 2012-11-27 18:21 +0000

#33972 — How to sort list of String without considering Special characters and with case insensitive

Fromsan <santosh.ssit@gmail.com>
Date2012-11-27 09:31 -0800
SubjectHow to sort list of String without considering Special characters and with case insensitive
Message-ID<e6a4709c-ded7-4092-8822-e3ac02b66271@googlegroups.com>
Please let me know how to sort the list of String in either ascending / descending order without considering special characters and case.
ex: list1=['test1_two','testOne','testTwo','test_one']
Applying the list.sort /sorted method results in sorted list ['test1_two', 'testOne', 'testTwo', 'test_one']
but the without considering the special characters and case it should be 
['testOne','test_one', 'test1_two','testTwo'] OR ['test_one','testOne','testTwo', 'test1_two' ]

list.sort /sorted method sorts based on the ascii value of the characters but Please let me knwo how do i achieve my expected one

[toc] | [next] | [standalone]


#33974

FromMRAB <python@mrabarnett.plus.com>
Date2012-11-27 17:54 +0000
Message-ID<mailman.315.1354039050.29569.python-list@python.org>
In reply to#33972
On 2012-11-27 17:31, san wrote:
> Please let me know how to sort the list of String in either ascending / descending order without considering special characters and case.
> ex: list1=['test1_two','testOne','testTwo','test_one']
> Applying the list.sort /sorted method results in sorted list ['test1_two', 'testOne', 'testTwo', 'test_one']
> but the without considering the special characters and case it should be
> ['testOne','test_one', 'test1_two','testTwo'] OR ['test_one','testOne','testTwo', 'test1_two' ]
>
> list.sort /sorted method sorts based on the ascii value of the characters but Please let me knwo how do i achieve my expected one
>
(I'm using Python 3.)

The .sort method accepts a 'key' argument, which lets you pass a
function that transforms the value being sorted before comparison:

 >>> def make_key(string):
	return string.replace('_', '').upper()

 >>> list1 = ['test1_two', 'testOne', 'testTwo', 'test_one']
 >>> list1.sort(key=make_key)
 >>> list1
['test1_two', 'testOne', 'test_one', 'testTwo']

I don't know how you define 'special'.

You could remove any characters which are special or keep any
characters which are not special, depending on how many characters are
defined as 'special':

from string import ascii_letters

# Sets are faster for this kind of thing.
ascii_letters = set(ascii_letters)

def make_key(string):
     return ''.join(c for c in string if c in ascii_letters).upper()

list1 = ['test1_two', 'testOne', 'testTwo', 'test_one']
list1.sort(key=make_key)

print(list1)

# Output is: ['testOne', 'test_one', 'test1_two', 'testTwo']

[toc] | [prev] | [next] | [standalone]


#33978

From"Prasad, Ramit" <ramit.prasad@jpmorgan.com>
Date2012-11-27 18:21 +0000
Message-ID<mailman.319.1354041785.29569.python-list@python.org>
In reply to#33972
san wrote:
> 
> Please let me know how to sort the list of String in either ascending / descending order without considering
> special characters and case.
> ex: list1=['test1_two','testOne','testTwo','test_one']
> Applying the list.sort /sorted method results in sorted list ['test1_two', 'testOne', 'testTwo', 'test_one']
> but the without considering the special characters and case it should be
> ['testOne','test_one', 'test1_two','testTwo'] OR ['test_one','testOne','testTwo', 'test1_two' ]
> 
> list.sort /sorted method sorts based on the ascii value of the characters but Please let me knwo how do i
> achieve my expected one

You can pass a key function into list.sort() and sorted(). This
allows you to customize the sorting. In the below examples
I use lambda but you can use a non-lambda function (if you need
more complexity).

Case insensitive searches are often done by converting the 
strings being compared into the same case. Here I turned
them all uppercase.

lst = ['test1_two', 'testOne', 'testTwo', 'test_one']
lst.sort(key=lambda x: x.upper())

This will filter non-alphanumeric characters. You may
be able to create and use a translation table instead.

lst.sort( key=lambda x: ''.join( c.upper() for c in x if c 
in string.letters+string.digits ) )


~Ramit



This email is confidential and subject to important disclaimers and
conditions including on offers for the purchase or sale of
securities, accuracy and completeness of information, viruses,
confidentiality, legal privilege, and legal entity disclaimers,
available at http://www.jpmorgan.com/pages/disclosures/email.  

[toc] | [prev] | [standalone]


Back to top | Article view | comp.lang.python


csiph-web