Changes between Version 21 and Version 22 of SummerofCodeIdeas

Show
Ignore:
Timestamp:
02/05/12 05:36:30 (16 months ago)
Author:
rgommers
Comment:

Update for 2012, add NA idea.

Legend:

Unmodified
Added
Removed
Modified
  • SummerofCodeIdeas

    v21 v22  
    11[[PageOutline]] 
    22 
    3 The SciPy GSoC projects are run under the umbrella of the PSF.  The PSF will be heavily weighting Py3K projects this year (see the following email). 
     3The SciPy GSoC projects are run under the umbrella of the PSF.  The PSF info for 2012 is not yet available, but that of 2011 should help in getting started: http://wiki.python.org/moin/SummerOfCode/2011.  In particular, look at what is expected of students: http://wiki.python.org/moin/SummerOfCode/Expectations.  
    44 
    5 {{{ 
    6 ----- Forwarded message from "C. Titus Brown" ----- 
     5= Summer of Code 2012 Ideas = 
    76 
    8 Hi all, 
     7== Improve missing data (NA) support == 
    98 
    10 it's that time of year again, and Google has decided to run the Google 
    11 Summer of Code again! 
     9Numpy 1.7 includes for the first time support for missing data, implemented in a way similar to how it works in R. There is a lot more to do in this direction, for example:  
    1210 
    13  http://groups.google.com/group/google-summer-of-code-discuss/browse_thread/thread/d839c0b02ac15b3f 
     11  1. Size - that requires bit masks and a decision that masks only take two values. 
     12  2. Speed - that requires support in the ufunc loops. 
     13  3. Functions - isna needs some help, like isanyna(a, axis=1) 
     14  4. More support in current functions. 
     15  5. Implement NA support for relevant projects which depend on numpy (for example pandas). 
    1416 
    15  http://socghop.appspot.com/ 
     17== Ideas from previous years that may still be relevant == 
    1618 
    17 Arc Riley has stepped up to run it for the PSF again this year, and I'm 
    18 backstopping him.  If you are interested in mentoring or kibbitzing on those 
    19 who are, please sign up for the soc2010-mentors mailing list here, 
    20  
    21  http://mail.python.org/mailman/listinfo/soc2010-mentors 
    22  
    23 This year we're proposing to solicit and prioritize applications for 
    24 Python 3.x -- 3K tools, porting old projects, etc.  Python 2.x projects 
    25 will be a distinct second.  There will be no "core" category this year, 
    26 although obviously if someone on one of the core teams wants to push a 
    27 project it'll help! 
    28  
    29 If you have an idea for a project, please send it to the -mentors list and add 
    30 it to the wiki at 
    31  
    32   http://wiki.python.org/moin/SummerOfCode/2010 
    33  
    34 We're also going to change a few things up to make it more useful to the PSF. 
    35 Specifically, 
    36  
    37  - the foundation is going to *require* 1 blog post/wk from each student. 
    38  
    39  - we're going to hire an administrative assistant to monitor the students. 
    40  
    41  - the student application process will be a bit more rigorous and job-app 
    42   like; the Django SF has been doing this for at least one round and they 
    43   claim that it results in much better and more serious students. 
    44  
    45  - we'll be focusing on student quality more than on project egalitarianism. 
    46   If project X can recruit three fantastic students to one fantastic and one 
    47   mediocre student for project Y, then project X gets three and project Y 
    48   gets one. 
    49  
    50 The hope is that this will make the GSoC much more useful for Python than it 
    51 has been in the past. 
    52  
    53 Arc will be posting something to the www.python.org site and python-announce 
    54 soon, too. 
    55  
    56 Followups to soc2010-mentors. 
    57  
    58 cheers, 
    59 --titus 
    60  
    61 ----- End forwarded message ----- 
    62 }}} 
    63  
    64 Given the focus on Py3K-related projects, the most likely proposals to be accepted will be ones that continue work on the Py3K port of NumPy or propose to port SciPy, matplotlib, or ipython to Py3K.  
    65  
    66 = Summer of Code 2010 Ideas = 
    67  
    68 == Ideas with significant existing momentum == 
    69  
    70  * Port !SciPy to Python 3.0 (or continue finishing up the porting of !NumPy) 
    71    * Improve `numpy.f2py` test coverage and fix any issues that crop up on Python 3. 
    72    * Port Scipy to Python 3 
    73      * Set up a build framework that uses 2to3 (see e.g. how it is [http://projects.scipy.org/numpy/browser/trunk/tools/py3tool.py done in Numpy]) 
    74      * After that, modify Scipy so that it works both on Python 2 and Python 3 -- this can be done one submodule at a time, so the work can be partitioned in manageable pieces. Some submodules will be more work than others. 
    75      * Knowledge of C is required in porting some of the submodules. There are, however, also pure-Python submodules. 
    76      * Some information on: [http://projects.scipy.org/numpy/browser/trunk/doc/Py3K.txt changes made in porting Numpy] 
    77  * datetime types - see the [http://projects.scipy.org/numpy/browser/trunk/doc/neps/datetime-proposal.rst NEP] and recent SVN versions of !NumPy 
    7819 * Continue work on [http://conference.scipy.org/abstract?id=19 Fwrap] started by Kurt Smith in GSoC2009 
    79  * integrate Jonathan Taylor's statistical models into scipy.stats - see [http://statsmodels.sourceforge.net/ scikits.statsmodels] (Skipper Seabold, GSoC2009), current task for this is improving scikits.statsmodels 
    80  
    81 ''Partially recycled from previous years' project ideas.'' 
    82  
    8320 * improve datasource and integrate it into all the numpy/scipy io http://projects.scipy.org/scipy/numpy/browser/trunk/numpy/lib/_datasource.py 
    84  * clean up, refactor scipy package structure: scipy.lib, scipy.misc, ''scipy.stsci'' (already gone) - is this still relevant? no discussion on it 
    85  * modernize, clean-up scipy.weave, integrate fast-vectorize 
    8621 * [http://scipy.org/scipy/scipy/milestone/StatisticsCleanup Statistics Cleanup] 
    8722 * '''scipy.ndimage''': Rewrite in Python where possible, port to Cython elsewhere.  Decide on a consistent coordinate framework.  As a bonus, fix boundary issues.